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(57) Abstract 

The present invention relates to the design of trimeric polypeptides using polypeptide structural elements derived from the tetranectin 
protein family, and their use in rational de novo design and production of multi-^ctional molecules includmg the application of the 
multi-functional molecules in protem library technology, such as phage display technology, diagnostic and therapeuuc systems such as 
human gene therapy and imaging. The trimeric polypeptides being constructed as a monomer polypeptide construct compnsing at least one 
tetranectin trimerising strucUiral element (TTSE) which is covalently linked to at least one heterologous moiety, said TTSE being capable 
of fonning a stable complex with two other TTSEs; or as an oligomer which is comprised of two monomer polypeptide constructs as 
mentioned above, and which comproses three TTSEs or a multiplum of three TTSEs, or which is comprised of three monomer polypeptide 
constructs. 
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TRIMERISING MODULE 

The present invention relates to the design of trimeric 
polypeptides using polypeptide structural elements derived 
from the tetranectin protein family, and their use in 
5 rational de novo design and production of multi-functional 
molecules includng the application of the multi-functional 
molecules in protein library technology, such as phage 
display technology, diagnostic and therapeutic systems, such 
as human gene therapy and imaging. 

10 BACKGROUND OF THE INVENTION 

Tetranectin is a Ca^^ -binding trimeric C-type lectin which is 
present in blood plasma and from the extracellular matrix of 
certain tissues. The tetranectin group of proteins comprises 
tetranectin isolated from man and from mouse and the highly 
15 related C-type lectin homologues isolated from the cartilage 
of cattle (Neame and Boynton, database accession number 
PATCHX:u22298) and from reef shark (Neame et al . , 1992, Neame 
at al., 1996 and database accession number p26258 and 
PIR2:A37289) . 



20 The mature tetranectin polypeptide chain of 181 amino acid 
residues is encoded in three axons as shown by molecular 
cloning and characterisation of the gene (Berglund & Peter- 
sen, 1992; Wewer & Albrechtsen, 1992). Exon 3 of the human 
tetranectin gene encodes a separate functional and structural 

25 unit, a single long-form so-called carbohydrate recognition 
domain (CRD), with three intra-chain disulphide bridges. The 
tetranectin CRD is considered to belong to a distinct class 
of C-type lectins (Day, 1994) clearly related to C-type 
lectins by sequence homology, conservation of disulptiide 

30 topology (Puhlendorff et al, 1987) and by the presence of an 
almost conserved suit of amino acid residues known to be 
involved in binding of calcium ions . 



iMJeSmUTE SHEET (RULE 26) 
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A published poster (Holtet et al 1996) has proposed tetranec- 
tin to be a trimer and that trimerisation is governed by the 
peptide encoded by exon 1. The peptide encoded by exon 1 was 
proposed to be "necessary and sufficient to govern trimerisa- 
5 tion" whereas the polypeptide encoded by exon 2 was proposed 
as being "involved in lysine -sensitive binding to 
plasminogen" . 

Tetranectin was first identified as a plasma protein binding 
to plasminogen by binding to the kringle-4 domain of 

10 plasminogen. Recent unpublished results (Graversen et al., 
manuscript for PNAS) proves (1) that the site in tetranectin 
involved in binding to plasminogen resides entirely in the 
CRD-domain (encoded by exon 3), (2) that binding is calcium 
sensitive, and (3) that the kringle-4 binding site in 

15 tetranectin overlaps the putative carbohydrate binding site 

of the CRD domain. Hence, there is now surprising definitive • 
evidence that TN exons 1 and 2, i.e. the trimerisation unit 
in TN does not exihibit any plasminogen-binding affinity. 
Accordingly, an artificial protein containing a TTSE unit as 

20 part of its architecture is not expected to interact with 
plasminogen or plasrain due to properties inherited from 
tetranectin. 

Tetranectin has also been reported to bind to sulfated 
polysaccharides like heparin (Clemmensen (1989) Scand J. 

25 Clin. Lab. Invest, vol 49:719-725). We have new results 

showing that the CRD domains of tetranectin are not involved 
in this protein -polysaccharide interaction. In fact, the site 
in tetranectin is located in the N- terminal region of exon 1 
and may be abolished by removal or mutagenis of N-terminal 

30 lysine residues (Graversen et al., manuscript), processes 

that do not inhibit trimerisation. TTSEs that include most or 
all of TN exon 1 therefore confer an affinity for sulfated 
polysaccharides to any designed protein which encompasses 
such a TTSE as part of its structure. If desired, however, 

35 this affinity can be reduced or abolished by N-terminal 

truncation or mutagenesis of lysine residues in the part of 
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the TTSE that corresponds to the N-terminal 8-10 amino acid 
residues of exon 1 (Graversen et al . , unpublished) . 
With respect to gene therapy which is also withing the scope 
of the present invention, there is only a limited number of 
5 basic strategies for gene therapy which show some promise in 
preclinical models so far. The two major strategies e.g for 
the treatment of malignant tumors are cytokine -gene aided 
tumor vaccination and selective prodrug activation, whereas 
the first strategy relies on the strong immunostimulatory 

10 effect of a relatively small number of genetically modified 
cytotoxic T cells or tumor cells, the second one is based on 
conversion of a nontoxic prodrug into a toxic product by an 
enzyme -encoding gene where the toxic effect is exerted also 
on non- transduced dividing tumor cells due to a so-called 

15 bystander effect. Alternatively, strategies can be envisaged 
where the malignant phenotype of a cell is reversed by either 
inactivating an oncogene or reestablishing an inactivated 
tumor suppressor gene. In both cases, highly efficient gene 
transfer to the cells in a tumor is required. Although high 

20 efficiencies of gene transfer can be obtained in vitro and 
even in vivo under certain circumstances, correction of the 
malignant phenotype by reversing the major oncogenic change 
in the tumor cells is unlikely to result in normal cells. 
Thus, selective induction of tumor cell death by use of the 

25 present invention would be preferable, and the development of 
methods enabling such induction will be of great importance. 

A major problem in connection with the gene therapy is the 
incorporation of foreign material into the genome. Viruses, 
however, have only been partially successful in overcoming 

30 this problem. Hence the initial efforts at gene therapy are 
still directed towards engineering viruses so that they could 
be used as vectors to carry therapeutic genes into patients. 
In the still very immature in vivo method of somatic gene 
therapy, where a vector could be injected directly into the 

35 bloodstream,- or more preferably by transmucosal delivery, the 
present invention may be utilized due to the surprising 
number of ways the gene therapy may be targeted. 
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For many gene-therapy applications in the future, it is 
probable that a synthetic hybrid system will be used that 
incorporates engineered viral component for target-specific 
binding and core entry, immunosuppressive genes from various 
5 viruses and some mechanism that allows site-specific 

integration, perhaps utilizing AAV sequences or an engineered 
retroviral integrase protein. In addition, regulatory 
sequences from the target cell itself' will be utilized to" 
allow physiological control of expression of the inserted 
10 genes. All these components would be assembled in vitro in a 
liposome -like formulation with additional measures taken to 
reduce immunogenicity such as concealment by PEG 

As mentioned, one of the current problems in gene therapy is 
the efficient delivery of nucleic acids to as many as 

15 possible of a specific population of cells in the body, and 
it is often not possible to find e.g. an appropriate viral 
vector that will find that particular cell population 
efficiently and selectively {Review on aspects of gene 
therapy: Schaper, W & Ito, W.D. Current Opinion in 

20 Biotechnology, 1996, vol. 7, 635-640. Nature Biotechnology 
1998 vol 16 is an entire volume dedicated to protein- and 
gene delivery) . 

Given the possibility of in vitro generation of a human 
antibody against virtually any target antigen by phage 

25 technology, it follows that TTSEs, where one of the subunits 
is modified with a membrane integrating or associating 
entity, may be used as a practicable tool for generating a 
viral, bacterial or preferentially artificially assembled 
lipomal vehicle that will allow selective delivery of the 

30 contained material by infection or transfection of any cell 
population to which such a specific antibody may be 
generated. Moreover vehicles may, with the use of TTSEs, be 
individualised by selection of patient specific antibodies or 
by assembling TTSE units conjugated with scFvs selected from 

35 an ensemble of antibodies selected by the particular markers 
of the disease. 
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StJMMARY OF THE INVENTION 

It has surprisingly been found by the present inventors that 
the human tetranectin polypeptide (and derivatives thereof) 
is capable of forming very stable trimers which have a number 
5 of advantageous characteristics and uses. Notably, the tetra- 
nectin molecule includes a trimerising structural element 
which can be used as carrier of other chemical entities, 
thereby providing a carrier molecule of a hitherto unseen 
versatility. 

10 Prior published knowledge in the field of providing trimeri- 
sing polypeptides of choice includes the disclosure in WO 
95/31540 by Hoppe and Reid which describes a trimerisation 
module derived from collectin coiled coil structures and its 
application in engineering of artificially trimerised pro- 

15 teins. Several interesting areas of application are common to 
that patent publication and to the present disclosure. How- 
ever, in several ways the properties of the trimerisation 
modules derived from the tetranectin protein family as dis- 
closed herein are markedly different in fundamental architec- 

20 ture and represent surprisingly improved properties in com- 
parison with the collectin trimerisation unit: 

(1) Although the spatial structures of both trimerisation 
modules at a superficial level appear as similar in that both 
are ternary coiled coil structures of roughly equivalent • 

25 spatial size the stmactural basis for adopting this spatial 
configuration is markedly distinct between the two groups of 
proteins. In fact, it is so distinct that the common belief 
prior to the work of Holtet et al. on cross-linking of human 
tetranectin (Holtet et al . , 1996) was that this family of 

30 proteins were tetrameric (hence the name) . Accordingly; the 
sequences of the tetranectin family of trimerisation modules 
does not conform to the declared common motif delineated for 
the collectin family (WO 95/31540, page 8). 
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(2) The thermal stability of the tetranectin trimerisation 
module (as shown in the examples) is such that the trimer can 
be shown to exist even at about 60^C (Example 4, trimerised 
tetranectin) or at about 70°C (Example 3, trimerised ubiqui- 

5 tin) , whereas a collectin trimer unit falls apart at about 
50-55°C (WO 95/31540, Example 1, page 36 therein). 

(3) Whereas it remains uncertain whether the collectin trime- 
risation domain possibly allows attachment of fusion partners 
at C- terminal ends of the trimerisation module, and whereas 

10 no example has been reported of successful or claimed suc- 
cessful attachment of a foreign protein (except for the GST 
fusion partner) to the N- terminal region of the collectin 
trimerisation module, the information disclosed herein demon- 
strates that the tetranectin trimerisation module is more 

15 versatile in that is allows attachment of foreign proteins to 
either, as well as to both, terminus or termini simultaneous- 
ly (Examples 1-4) . This has important consequences as the 
tetranectin trimerisation module may be deployed to construct 
molecules that are able to interact (each end with a binding 

20 valency up to 3) simultaneously with two bulky interaction 
partners like e.g. cell surfaces. 

(4) The virtual absence of subunit exchange between monomers 
of a trimer that has been trimerised using the tetranectin 
trimerisation modules disclosed herein is by first principles 

25 of thermodynamics correlated with the surprisingly high 

thermal stability of the complex. It will hence be apparent 
that the advantages inherent to the "pick-and-mix" applica- 
tions of the technology, as disclosed herein, may be used to 
much greater advantage because of the much longer shelf life 

30 expected for the heterofunctional products of the present 
invention. 

The polypeptide constructs CIIH6FXTN123, H6FXTN123, H6FXTN12, 
and H6FXTN23 which all involve parts of the tetranectin 
molecule have previously been prepared (cf . e.g. WO 94/18227) 
35 but these constructs have all been provided with a view to 
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facilitating expression and/or purification of the tetranec- 
tin derived moiety of the constructs. To the best of the 
inventors' knowledge no publications exist which reports any 
use of tetranectin derivatives as "building blocks" to which 
5 other chemical moieties advantageously could be coupled. 

Hence, in its broadest aspect the present invention relates 
to a monomer polypeptide construct comprising at least one 
tetranectin trimerising structural element (hereinafter 
designated a TTSE) which is covalently linked to at least one 

10 heterologous moiety, said TTSE being capable of forming a 
stable complex with two other TTSEs, with the proviso that 
the heterologous moiety is different from any of the fusion 
proteins CIIH6FXTN123, H6FXTN123, H6FXTN12, H6FXTN23, the 
sequences of which are shown in SEQ ID NOs: 24-27. It is 

15 preferred that the heterologous moiety is one which does not 
exclusively facilitate expression and/or purification of the 
monomer polypeptide construct. 

The invention further relates to oligomeric molecules com- 

20 prised of at least two of such monomer polypeptide con- 
structs, and the invention also relates to methods of prepa- 
ring the monomer polypeptide constructs and the oligomers. 
The invention further relates to a kit comprising monomer 
polypeptide constructs in separate packages, ready for use in 

25 a "pick-and-mix" approach for use of the monomers. This pick 
and mix approach is for use in therapeutic as well as for 
dianostic use. Eg. having a tumor with a known and specific 
epitope to which an antibody is avaible. The kit may then 
comprise a first TTSE conjugated to an relevant antibody, a 

30 second component which may comprise af TTSE coupled to an 

imaging compound. In a second aspect the second component may 
comprise a drug or a prodrug relevant for treating the tumor. 
In a still further aspect a third TTSE component being a 
monotoring compound, e.g. indicative of the progress of the 

35 targeting. 



wo 98/56906 „ PCT/DK98/00245 

As is clear from this illustration the present invention 
relates to a large amount of combinations allowing an 
individul design for a large number of circumstances. 

Finally, the invention also pertains to fragments which 
5 include nucleic acid sequences which encode the monomer 
polypeptide constructs, as well as to vectors and cells 
containing these nucleic acid fragments. 

LEGENDS TO THE FIGURES 

Fig. 1: Amino acid sequence of the amino terminal region of 

10 tetranectin. 

Amino acid sequence (in one letter code) from El to L51 of 
tetranectin (SEQ ID NO: 7) . Exon 1 comprises residues El to 
D16 and exon 2 residues V17 to V49, respectively. The alpha 
helix extends beyond LSI to K52 which is the C-terminal amino 

15 acid residue in the alpha helix. 

Fig. 2: Alignment of the amino acid sequences of the trimeri- 
sing structural element of the tetranectin protein family. 
Amino acid sequences (one letter code) corresponding to 
residue V17 to K52 comprising exon 2 and the first three 

20 residues of exon 3 of human tetranectin (SEQ ID NO: 7) ; 
murine tetranectin (Sorensen et al., Gene, 152: 243 -245, 
1995) ; tetranectin homologous protein isolated from reef- 
shark cartilage (Neame and Boynton, 1992,1996); and tetranec- 
tin homologous protein isolated from bovine cartilage (Neame 

25 and Boynton, database accession number PATCHX:u22298) . Resi- 
dues at a and d positions in the heptad repeats are listed in 
boldface. The listed consensus sequence of the tetranectin 
protein family trimerising structural element comprise the 
residues present at a and d positions in the heptad repeats 

30 shown in the figure in addition to the other conserved resi- 
dues of the region, "hy" denotes an aliphatic hydrophobic 
residue . 
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Fig. 3: Construction of the expression plasmids pTHSFXtripa 
and pTH6FXtripb. 

The amplified DNA fragments tripa and tripb harbouring the 
tetranectin amino acid sequence (SEQ ID NO: 7) from El to T48 
5 and El to K52, respectively, fused in the 5' end to 

nucleotide sequences encoding a FX^ cleavage site IQGR (SEQ 
ID NO: 4) and the recognition sites for the restriction 
endonucleases Bglll and Kpnl, were cut with the restriction 
enzymes Bell and Hindi II and ligated into the BamHI and 
10 Hindlll sites of the expression plasmid pT7H6 (Christensen et 
al., 1991) using standard procedures . 

Fig. 4: Predicted amino acid sequence of the fusion proteins 
H6FXtripa (SEQ ID NO: 28) and H6FXtripb (SEQ ID NO: 29) 
encoded by the expression plasmids pTHSFXtripa and 
15 pTH6FXtripb, respectively. 

Fig. 5: Construction of the expression plasmids pTH6FXTN123 
and PTCIIH6FXTN123 . 

The amplified DNA fragment corresponding to the full length, 
mature tetranectin monomer (SEQ ID NO: 7) from El to V181 

20 fused in the 5' end to nucleotide sequences encoding a FXg 
cleavage site lEGR (SEQ ID NO: 10) was cut with the restric- 
tion enzymes BamHI and Hindlll and ligated into the corre- 
sponding sites of the expression plasmids pT7H6 (Christensen 
et al., 1991) and pTCIIHS using standard procedures. pTCIIHS 

25 was derived from pT7H6 by substitution of the Ndel - Hindlll 
fragment of pT7H6 with the Ndel - Hindlll fragment of pLcII 
(Nagai and Thogersen, 1987) encoding the first 32 residues of 
the lambda cll protein MVRANKRNEALRIESALLNKIAMLGTEKTAEG (SEQ 
ID NO: 11) fused in the 3' end to a nucleotide sequence 

30 encoding the H6 sequence GSHHHHHHGS (SEQ ID NO: 12) . 

Fig. 6: Predicted amino acid sequence of the fusion proteins 
H6FXTN123 (SEQ ID NO: 25) and CIIH6FXTN123 (SEQ ID NO: 24) 
encoded by the expression plasmids pTH6FXTN123 and 
pTCIIH6FXTN123 , respectively. 
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Fig. 7: Construction of the expression plasmids pTH6FXTN12, 
pTH6FXTN23, and pTH6FXTN3 . 

The amplified DNA fragments corresponding to the tetranectin 
derivatives TN12 and TN3 from El to V49 and A45 to V181, 
5 respectively (SEQ ID NO: 7) fused in the 5' end to nucleotide 
sequences encoding the FX^ cleavage site lEGR (SEQ ID NO: 10) 
was cut with the restriction enzymes BamHI and Hindlll and 
ligated into the corresponding sites of the expression plas- 
mids pT7H6 (Christensen et al . , 1991) using standard pro- 

10 cedures. The amplified DNA fragment corresponding to the 

tetranectin derivative TN23 from V17 to V181 (SEQ ID NO: 7) 
fused in the 5' end to nucleotide sequences encoding the FX^ 
cleavage site IQGR (SEQ ID NO: 4) was cut with the restric- 
tion enzymes BamHI and Hindlll and ligated into the corre- 

15 spending sites of the expression plasmids pT7H6 (Christensen 
et al., 1991) using standard procedures. 

Fig. 8: Predicted amino acid sequence of the fusion proteins 
H6FXTN12 (SEQ ID NO: 26), H6FXTN23 (SEQ ID NO: 27), and 
H6FXTN3 (SEQ ID NO: 30) encoded by the expression plasmids 
20 pTH6FXTN12, pTH6FXTN12, respectively. 

Fig. 9: Gel filtration analysis of TN123, TN23, and TN3 
Analytical gel filtration of the recombinant tetranectin 
derivatives TN123, TN23, and TN3 were performed on a Superose 
12 HR 10/30 column (Pharmacia, Sweden) with a total volume of 
25 25 ml in 100 mM NaCl and 50 mM Tris-HCl pH 8 and a flow rate 
of 0.2 ml/min. Vertical bars at peak maxima identify elution 
profiles for each of the three proteins. 

Fig. 10: Cross-linking analysis of TN123 and CIIH6FXTN123 . 
Samples of TN123, CIIH6FXTN123 and mixtures of both were 

30 incubated with DMSI and analyzed by SDS-PAGE (12% gel) . 

Before addition of DMSI, protein mixtures were subjected to 
sub\jnit exchange by incubation at 70 for varying length of 
time. Protein marker of 94, 68, 43 and 30 kDa, top to bottom 
(lane M) . CIIH6FXTN123 fusion protein (lane 1) . TN123 (lane 

35 2) . DMSI treated CIIH6FXTN123 (lanes 3 and 6) . DMSI -treated 
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TN123 (lane 4) . Identical samples of DMSI treated mixtures of 
CIIH6FXTN123 and TN123 without heat exposure (lanes 5 and 7) 
and heat treated for 2.5 sec, 15 sec, 2.5 min. and 10 min., 
respectively, before treatment with DMSI (lanes 8-11) . 

5 Fig- 11: Cross-linking analysis of the recombinant tetranec- 
tin derivatives TN123, TN23, TN3, and H6FXTN12 . 
The recombinant proteins TN123, TN23,TN3, H6FXTN12 or mix-" 
tures of TN123 and each of the other were analyzed by SDS- 
PAGE. Protein marker of 94, 68, 43, 30, 20, and 14.4 kDa, top 

10 to bottom (lane M) . TN123 cross-linked with DMSI (lane 1) . 
TN123 and H6-rTN12 cross-linked with DMSI without and with 
heat treatment at 70 °C for two min. (lanes 2 and 3) . H6FXTN12 
cross-linked with DMSI (lanes 4 and 5) . Mixture of TN123 and 
H6FXTN12, no cross -linking (lane 6) . Cross -linking of TN123 

15 and TN23 without and with heat treatment at 70**C for two min. 
(lanes 7 and 8) . Cross -linking of TN23 (lane 9) . Mixture of 
TN123 and TN23 without cross-linking (lane 10) . TN123 cross- 
linked by DMSI (lane 11) . Cross-linking of TN123 and TN3 
without and with heat treatment for two min. (lanes 12 and 

20 13) . Cross -linking of TN3 (Lane 14) . Mixture of TN123 and 
TN3, no cross-linking (lane 15). 

Fig. 12: Cross-linking based analysis of the trimer thermal 
stability. 

In parallel experiments TN123 and the fusion protein 
25 H6FXtripb-UB (SEQ ID NO: 31) were cross-linked with DMSI at 
different temperatures and the samples analyzed by SDS-PAGE. 
Protein marker of 94, 68, 43, 30, 20, and 14.4 kDa, top to 
bottom (lane M) . TN123 without cross-linking (lane 1) . TN123 
cross-linked with DMSI for 15 min. at 37«>C, SO^C, 60°C, and 
30 70<>C (lanes 2 to 5), respectively. The fusion protein 

H6FXtripb-UB (SEQ ID NO: 31) without cross-linking (lane 6) . 
H6FXtripb-UB cross-linked with DMSI for 15 min. at 37^0, 
50°C, 60°C, and 70^C (lanes 7 to 10), respectively and 
H6FXtripb-UB- incubated at 70°C for 15 min. (lane 11). 
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Fig. 13: Construction of the expression plastnid pTHSFXtripb- 
UB, 

The amplified DNA fragment comprising the nucleotide sequence 
(SEQ ID NO: 16) encoding the ubiquitin amino acid sequence 
5 (SEQ ID NO: 19) from Q2 to G76 was cut with the restriction 
enzymes Bell and Hindlil and ligated into the BamHI and 
Hindlll sites of the expression plasmid pT7H6FXtripb (Example 
1) using standard procedures. 

Fig. 14: Predicted amino acid sequence of the fusion protein 
10 H6FXtripb-UB (SEQ ID NO: 31) encoded by the expression 
plasmid pTH6FXtripb-UB. 

Fig. 15: Construction of the expression plasmid pTHSFXscFV 
{CEA6)tripb. 

The DNA fragment, amplified with the primer pair SEQ ID NOs: 
15 21 and 22, comprising the nucleotide sequence SEQ ID NO: 20 
encoding the single chain antibody CEA6, scFV (CEA6), amino 
acid sequence from Ql to A261 was cut with the restriction 
enzymes BamHI and Kpnl and ligated into the Bglll and Kpnl 
sites of the expression plasmid pT7H6FXtripb (Example 1) 
20 using standard procedures. 

Fig. 16 : Predicted amino acid sequence of the fusion protein 
H6FXscFV{CEA6) tripb encoded by the expression plasmid 
pHGFXscFV (CEA6 ) tripb . 

Fig. 17: Construction of the expression plasmid 

25 pTH6FXtripbs'cFX(CEA6) . 

The DNA fragment, amplified with the primer pairs having SEQ 
ID NO: 21 and 23, comprising the nucleotide sequence (SEQ ID 
NO: 20) encoding the single chain antibody CEA6, scFV (CEA6) , 
amino acid sequence from Ql to A261 was cut with the restric- 

30 tion enzymes BamHI and Hindi I I and ligated into the BamHI and 
Hindlll sites of the expression plasmid pT7H6FXtripb (Example 
1) using standard procedures. 
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Fig. 18: Predicted amino acid sequence of the fusion protein 
H6FXtripbscFv(CEA6) encoded by the expression plasmid 
pH6FXtripbscFv(CEA6) . 

Fig. 19: Construction of the expression plasmid 
5 pTH6FXscFv(CEA6) tripbscFX{CEA6) . 

The DNA fragment, amplified with the primer pair SEQ ID NO: 
21 and 23, comprising the nucleotide sequence (SEQ ID NO:" 20) 
encoding the single chain antibody CEA6, scFV (CEA6) , amino 
acid sequence from Ql to A261 was cut with the restriction 
10 enzymes BamHI and Hindlll and ligated into the BamHI and 
Hindu I sites of the expression plasmid 

pT7H6FXscFv(CEA6) tripb (Example 4) using standard procedures. 

Fig. 20: Predicted amino acid sequence of the fusion protein 
H6FXscFv(CEA6) tripbscFv(CEA6) (SEQ ID NO: 34) encoded by the 
15 expression plasmid pH6FXscFv(CEA6) tripbscFv(CEA6) . 

Fig. 21: Cross-linking analysis of the H6FXtripbscFv(CEA6) 
fusion protein (SEQ ID NO: 33) . 
In parallel experiments the fusion proteins 
H6FXtripbscFv(CEA6) (SEQ ID NO: 33) and TN123 were cross- 

20 linked at room temperature for 30 min. with 0 mg/ml, 0.5 

mg/ml, 1.0 mg/ml, 1.5 mg/ml, and 2.0 mg/ml of DMSI, respec- 
tively. Lane 1: H6FXtripbscFv(CEA6) without DMSI, 
H6FXtripbscFv(CEA6) with 0.5 mg/ml DMSI (lane 2), 
H6FXtripbscFv(CEA6) with 1.0 mg/ml DMSI (lane 3), 

25 H6FXtripbscFv(CEA6) with 1.5 mg/ml DMSI (lane 4) and 

H6FXtripbscFv{CEA6) with 2.0 mg/ml DMSI (lane 5). Protein 
marker of 94, 68, 43, 30, 20, and 14.4 kDa, top to bottom 
(lane M) . Lane 6: TN123 without DMSI, TN123 with 0.5 mg/ml 
DMSI (lane 7), TN123 with 1.0 mg/ml DMSI (lane 8) TN123 with 

30 1.5 mg/ml DMSI (lane 9) and TN123 with 2.0 mg/ml DMSI (lane 
10) . 



DETAILED DISCLOSURE OF THE INVENTION 
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The term "trimerising structural element" (TTSE) used in the 
present description and claims is intended to refer to the 
portion of a polypeptide molecule of the tetranectin family 
which is responsible for trimerisation between monomers of 
5 the tetranectin polypeptide. The term is also intended to 

embrace variants of a TTSE of a naturally occurring tetranec- 
tin family member, variants which have been modified in the 
amino acid sequence without adversely affecting, to any 
substantial degree, the trimerisation properties relative to 

10 those of the native tetranectin family member molecule. 
Specific examples of such variants will be described in 
detail herein, but it is generally preferred that the TTSE is 
derived from human tetranectin, murine tetranectin, C-type 
lectin of bovine cartilage, or C-type lectin of shark carti- 

15 lage. Especially preferred is monomer polypeptide constructs 
including at least one TTSE derived from human tetranectin. 

The 49 residue polypeptide sequence encoded by exons 1 and 2 
of tetranectin (Fig. 1) appears to be unique to the tetranec- 
tin group of proteins (Fig. 2) as no significant sequence 

20 homology to other known polypeptide sequences has been estab- 
lished. In preparation for experimental investigations of the 
architecture of tetranectin a collection of recombinant 
proteins was produced, the collection including complete 
tetranectin, the CRD domain (approximately corresponding to 

25 the polypeptide encoded by exon 3), a product corresponding 
to the polypeptide encoded by exons 2+3, a product correspon- 
ding to exons 1+2 (Holtet et ai., 1996; Example 2). As 
detailed in Example 2 we now know differently: tetranectin is 
indeed a trimer, but the exon 2 encoded polypeptide is in 

30 fact capable of effecting trimerisation by itself as evi- 
denced by the observation that the recombinant protein corre- 
sponding to exons 2+3 is in fact trimeric in solution. 

3D- structure analysis of crystals of full-length recombinant 
tetranectin (Nielsen et al . , 1996; Nielsen, 1996; Larsen et 
35 al., 1996; Kastrup, 1996) has shown that the polypeptide 
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encoded in axon 2 plus three residues encoded in axon 3 form 
a triple alpha helical coiled coil structure. 

From the combination of sequence and structure data it 
becomes clear that trimerisation in tetranectin is in fact 
5 generated by a structural element (Fig. 2), comprising the 
amino acid residues encoded by exon two and the first three 
residues of exon 3 by an unusual heptad repeat sequence, that 
apparently is unique to tetranectin and other members of its 
group: This amino acid sequence (Fig. 2) is characterised by 

10 two copies of heptad repeats (abcdefg) with hydrophobic 
residues at a and d positions as are other alpha helical 
coiled coils. These two heptad repeats are in sequence fol- 
lowed by an unusual third copy of the heptad repeat, where 
glutamine 44 and glutamine 47 not only substitute the 

15 hydrophobic residues at both the a and d position, but are 
directly involved in the formation of the triple alpha 
helical coiled coil structure. These heptad repeats are 
additionally flanked by two half -repeats with hydrophobic 
residues at the d and a position, respectively. 

20 The presence of beta-branched hydrophobic residues at a or d 
positions in alpha helical coiled coil are known to influence 
the state of oligomerisation. In the tetranectin structural 
element only one conserved valine (number 37) is present. At 
sequence position 29 in tetranectin no particular aliphatic 

25 residue appears to be preferred. 

In summary, it is apparent that the triple stranded coiled 
coil structure in tetranectin to a large extent is governed 
by interactions that are unexpected in relation to those 
characteristic among the group of known coiled coil proteins. 

30 The TTSEs form surprisingly stable trimeric molecules . 

(Examples 2, 3 and 4) . The experimental observations, that 
(1) a substantial part of the recombinant proteins exists in 
the oligomeric state of - and can be cross-linked as - 
trimeric molecules even at 70°C and (2) that exchange of 
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monomers between different trimers can only be detected after 
exposure to elevated temperature are evidence of a extremely 
high stability of the tetranectin trimerising structural 
element. This feature must be reflected in the amino acid 
5 sequence of the structural element. In particular, the pre- 
sence and position of the glutamine containing repeat in the 
sequential array of heptad repeats is, together with the 
presence and relative position of the other conserved resi- 
dues in the consensus sequence (Fig. 2), considered important 

10 for the formation of these stable trimeric molecules. For 

most practical uses the cysteine residue 50 should be mutage- 
nized to serine, threonine, methionine or to any other amino 
acid residue in order to avoid formation of an unwanted 
inter-chain disulphide bridge, which eventually would lead to 

15 uncontrolled multimerisation, aggregation and precipitation 
of a polypeptide product harbouring this sequence. 

In particular in conjunction with the trimer- stabilising exon 
1 encoded polypeptide (tetranectin residues 1 to 16, see 
Example 2), the tetranectin trimerising structural element is 

20 a truly autonomous polypeptide module retaining its struc- 
tural integrity and propensity to generate a highly stable 
homotrimeric complex whether it is attached or not by a 
peptide bond at either or at both termini to other proteins. 
This unique property is demonstrated in the accompanying 

25 examples, which provide experimental proof, that polypeptide 
sequences derived from heterologous proteins may. readily be 
trimerised when joined as fusion proteins to the tetranectin 
trimerising structural element. This remains valid irrespec- 
tive of whether the heterologous polypeptide sequences are 

30 placed amino- terminally or carboxy- terminally to the trimeri- 
sing element allowing for the formation of one molecular 
assembly containing up to six copies of one particular 
polypeptide sequence or functional entities, or the formation 
of one molecular assembly containing up to six different 

35 polypeptide sequences, each contributing their individual 
functional property. 
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Since three TTSEs of naturally occurring human tetranectin 
forms up a triple alpha helical coiled coil, it is preferred 
that the stable complex formed by the TTSEs of the invention 
also forms a triple alpha helical coiled coil. 

5 The "tetranectin family" are polypeptides which share the 
consensus sequence shown in Fig. 2 or a sequence which are 
homologous at sequence level with this consensus sequence. 
Hence, monomer polypeptide constructs of the invention are 
preferred which comprise a polypeptide sequence which has at 
10 least 68% sequence identity with the consensus sequence shown 
in Fig. 2, but higher sequence identities are preferred, such 
as at least 75%, at least 81%, at least 87%, and at least 
92%. 



By the term "heterologous moiety" is herein meant any chemi- 

15 cal entity which can be linked covalently to a TTSE and to 
which the TTSE is not natively covalently bound. Hence, the 
heterologous moiety can be any covalent partner moiety known 
in the art for providing desired binding, detection, or 
effector properties. The heterologous moiety can be a ligand 

20 binding structure such as a receptor molecule or the ligand 
binding part of a receptor molecule, an antibody, an antigen 
binding antibody fragment, or a molecule having antibody 
characteristics such as e.g. the "diabodies" described in 
EP-A-0 672 142, or other ligand binding molecules such as 

25 avidin or streptavidin, or a lectin; a toxin such as ricin; a 
detectable label such as a fluorescence labelled molecule, a 
radioactively labelled molecule, an enzymatically labelled 
molecule; an in situ activatable substance, such as a mo- 
lecule which can be induced by a magnetic field or by 

30 radiation to be radioactively or chemically active; an enzyme 
such as a peroxidase; a radioactive moiety such as a 7-, of-, 
jS"-, or jS^ -emitting molecule, e.g. a molecule comprising one 

14 3 32 33 

or more radioactive isotopes selected from C, H, P, P, 
25s, 3fis, ^^Cl, ^^Na, 2%a, ^^K, ^^k, "^^k, and any isotopes 
35 conventionally utilized for the purposes of facilitating 
detection of probes or the purposes of providing localized 
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radiation so as to effect cell death; a cytokine such as an 
interferon or a leukotriene; PNA; a non-proteinaceous polymer 
such as a polymeric alkaloid, 11. a polyalcohol, a 
polysaccharide, a lipid and a polyamine; a photo cross- 
5 linking moiety, i.e. a chemical entity which effects cross- 
linking upon photo -activation; and a. group facilitating 
conjugation of the monomer polypeptide construct to a target. 

The heterologous moiety is preferably covalently linked to 
the TTSE by via a peptide bond to the N- or C-terminus of the 

10 TTSE peptide chain, via a peptide bond to a side chain in the 
TTSE or via a bond to a cysteine residue, but any way of 
coupling covalently heterologous material to a polypeptide 
chain will be useful. The skilled person will know of such 
possibilities, e.g. by consulting the teachings of WO 

15 95/31540 in this regard which are hereby incorporated by 
reference . 

However, one interesting aspect of the invention relates to a 
monomer polypeptide construct of the invention comprising two 
heterologous moieties which are linked via peptide bonds to 

20 the N- and C-terminus, respectively. This approach introduces 
a number of possibilities in terms of e.g. linking larger 
entities with oligomers of the invention by having specific 
activities coupled to each end of the monomers (as explained 
in detail below, the oligomers of the invention may also 

25 utilise a version of this principle, where e.g. one N-termi- 
nus and one C-terminus of an oligomer are linked via peptide 
bonds to independent heterologous moieties) . 

In general, a complex between two or three monomers are 
described in the following way: three monomers having one 

30 TTSE each forms a trimer designated (1+1+1), whereas a dimer 
formed between a monomer with two TTSEs and a monomer .with 
one TTSE is designated (1+2) . Other (undesired) trimers can 
of course be formed, e.g. (2+2+1), where two TTSEs are not 
"in use", but it is preferred that the oligomers of the 

35 invention use all of their available TTSEs during complex 
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formation. It should also be noted that the term "monomer 
polypeptide construct" is meant to designate a single 
polypeptide chain which may or may not have non-peptide 
groups coupled covalently to the polypeptide chain, whereas 
5 "dimeric polypeptide" or "dimer", "trimeric polypeptide" or 
"trimer" and "oligomer" (i.e. a dimer or trimer) in the 
present context are meant to designate non-covalent complexes 
of monomer polypeptide constructs. I.e., the traditional 
definitions of monomers and multimers do not apply in the 
10 present specification and claims. 

The TTSE as exemplified by exon 2 or exons 1 and 2 of human 
tetranectin, preferably so modified to allow only hetero- 
trimerisation between dissimilar (1+1+1) or (1+2) (cf . the 
below discussion) subunits may be deployed as a general 

15 affinity mediator, which can be coupled chemically to each 
member of a selection of target molecules. After such conju- 
gation with TTSE the target molecules may be homo- or hetero- 
trimerised as desired for any application. Similar deployment 
of dimerisation, using as one partner a polypeptide harbour- 

20 ing two TTSE sequences in-line, separated by a linker 

sequence of suitable length and character, may bet yet more 
advantageous, as in such case absolute control of 
stoichiometry in complex formation would be possible. Thus, 
an important embodiment of the invention is a monomer 

25 polypeptide construct of the invention comprising 2 TTSEs 
which are covalently linked by a spacer moiety which allows 
both of the 2 TTSEs to take part in complex formation with a 
third TTSE not being part of the monomer polypeptide con- 
struct, but equally important is the embodiment of the inven- 

30 tion where the monomer polypeptide construct comprises one 
single TTSE, so as to allow trimerisation between three 
monomers and hence providing the optimum degree of versati- 
lity with respect to the number of functional units wjiich can 
be easily incorporated into one single complex. 



35 In the embodiments where two TTSEs are present in the same 
monomer it is preferred that the spacer moiety has a length 
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and a conformation which favours complex formation involving 
both of the two TTSEs which are covalently linked by the 
spacer moiety. In this way, problems arising from undesired 
formation of trimers of the formats (2+1+1), (2+2+2), and 
5 (2+2+1) (wherein only one TTSE of each monomer participates 
in complex formation) can be diminished. Design and prepara- 
tion of suitable spacer moieties are known in the art and are 
conveniently effected by preparing fusion polypeptides having 
the format TTSE^-Spacer-TTSE^, where the spacer moiety is a 
10 polypeptide fragment (often a relatively inert one) , so as to 
avoid undesired reactions between the spacer and the sur- 
roundings or the TTSEs. 

One typical scenario, where such modification may be advan- 
tageous is the case of immunological detection where a chemi- 
15 cal conjugate of an antibody with enzymes such as peroxidase 
is used for in situ staining purposes in tissue or on western 
blots . 

A similar, but yet different, application example would be 
the deployment of TTSE to mediate conjugation of e.g. alka- 

20 line phosphatase and an oligonucleotide which would allow in 
situ identification of a given mRNA in a tissue sample 
concurrently with identification of any other mRNA molecule 
e.g. by interconnection of a second appropriate oligonucleo- 
tide and a signalling/visualisation molecule using e.g. the 

25 biotin - avidin/streptavidin affinity pair for conjugation. 
The point of having two or more selective affinity systems 
available for conjugating oligonucleotide probes and detector 
molecules is that as many different sequences may be detected 
simultaneously as there are affinity sets available. 

30 In terms of chemistry required to exploit TTSE as a conjuga- 
ting affinity-contributing agent, the peptide corresponding 
to exon 2 will have a sufficient affinity for most purposes, 
but incorporation of all, or some segment of the exon 1 
polypeptide will serve to increase affinity and stability. 

35 The properties of tetranectin mutants in which many of hydro- 
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philic (e.g. lys and glu) residues that are largely exterior 
in the coiled coil structure have been replaced with alanine 
appear similar to the native protein, suggesting that is 
indeed possible without interfering very much with stability 
5 of the trimeric structure to replace all glu, asp and lys 

residues by a combination of gin, asn, arg or ala, and there- 
by generate a sequence that, as an N- terminally blocked 
synthetic peptide, would be very easy to convert into a 
chemically stable active-ester component, e.g. an N-hydroxy 

10 succinimide ester of an acetylated peptide, that could react 
with (and thereby couple to) any exposed lysine side chain in 
a target molecule of interest. Such peptide synthesis, acti- 
vation and coupling chemistry will be readily designed and 
applied by a person skilled in the art of peptide chemistry, 

15 as will indeed any other conjugation chemistry, like the 
attachment and use of photo- act ivatable moieties like e,g. 
phenyl azides . 

In conclusion, it seems that the most important structure in 
native TTSE is the consensus sequence shown in Fig. 2, and 

20 that large variations in the polypeptide chain may be 

allowed. Hence, one advantageous embodiment of the monomer 
polypeptide construct of the invention is one where at least 
one amino acid residue selected from the group consisting of 
amino acid residue nos . 6, 21, 22, 24, 25, 27, 28, 31, 32, 

25 35, 39, 41, 42, is/are substituted by any non-helix breaking 
amino acid residue, the amino acid residue numbering refer- 
ring to amino acid residues in SEQ ID NO: 7. All these resi- 
dues have been shown not to be directly involved in the 
intermolecular interactions which stabilises the trimeric 

30 complex between three TTSEs of native tetranectin monomers 
and it is therefore expected that these amino acids may be 
safely substituted with any amino acid which will not have an 
adverse effect on helix formation (notably proline, which 
introduces a rigid bend in a polypeptide chain) . 



35 Another advantageous embodiment of the monomer polypeptide 

construct of the invention is one which is free from any free 
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amino and/or carboxy groups. This would favour synthesis of a 
TTSE by means of solid or liquid phase peptide synthesis, 
since there would be no need of introducing any protective 
groups during peptide synthesis. 

5 Since the consensus sequence of Fig. -2 is believed important 
and since this consensus sequence embraces the above-dis- 
cussed heptad repeat, it is according to the invention pre- 
ferred that the TTSE comprises a repeated heptad having the 
formula a-b-c-d-e-f -g (N to C) , wherein residues a and d 

10 generally are hydrophobic amino acids. However, since "a'« and 
"d" in the third of the complete heptads of all known members 
of the tetranectin family are constituted of glutamine, it is 
most preferred that the TTSE comprises the heptad repeated 3 
times and that the last occurrence of the heptad has a 

15 glutamine residue corresponding to residues a and d. 

Since exon 2 of native members of the tetranectin family 
seems to contain the necessary elements to effect stable 
trimerisation, it is preferred that the monomer polypeptide 
construct is free of substantial parts of tetranectin which 

20 is encoded by exon 3 and/or lacks substantial parts of tetra- 
nectin which is encoded by exon 1. However, since exon 1 
encoded material seems to stabilise the trimeric native 
tetranectin, it is especially preferred that all or part of 
exon 1 is part of the monomer polypeptide construct, and it 

25 also seems to be rational to include the first three amino 
acids encoded by exon 3, since these are known to take part 
of the formation of the native triple alpha helical coiled 
coil in human tetranectin. 

One particularly interesting embodiment of the invention is 
30 the possibility of designing oriented molecular assemblies, 
where one or more functional entities are located N- termin- 
ally to the trimerising element and one or more functional 
entities are located C- terminally to the element. Such types 
of design may be particularly advantageous where a certain 
35 relative ratio is desired among the different functional 
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entities included in a specific molecular unit. Such type of 
design may in addition be used if one or more functional 
entities for either structural or functional reasons appear 
incompatible within the same construct. Such may be the case 
5 if one or more of the functional entities are expressed by 
large or bulky protein domains which for steric reasons might 
prevent formation of the trimeric molecular unit due to 
sterical constraints. 

The possibility of constructing bi -polar three-way fusion 
proteins in which one functionality is placed N- terminally to 
the coiled coil structure and a different functionality is 
placed C-terminally is additionally advantageous in applica- 
tions where large spatial separation between the two 
functionalities are desirable for optimal function. Examples 
of such application are e.g. the deployment of binding 
domains (e.g. antibody-derived binding modules) for recog- 
nition and binding to binding sites located at or close to 
large structures like cell membranes in cases where it is 
advantageous to allow for binding of the other end of the 
trimerised molecule to a different, but also bulky target. 

Hence, as discussed above , the oligomers of the invention may 
be used to join e.g. bulky surfaces by the oligomer according 
to the invention comprising at least one heterologous moiety 
which is positioned N- terminally to a TTSE and at least one 
25 heterologous moiety which is positioned C-terminally to a 

TTSE. The two heterologous moieties can be either part of the 
same monomer polypeptide construct or parts of two separate 
monomer polypeptide constructs . 

The extraordinarily high stability of any trimer containing 
30 the tetranectin trimerisation module under physiological 

buffer and temperature condition (i.e. absence of deuaturant, 
temperature not exceeding 40<>C) in combination with the ease 
by which exchange of monomer subunits between trimers can be 
effected by incubation at moderately elevated temperature or 
35 in the presence of denaturants provide for a unique opportu- 
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nity to deploy the trimerisation module as a vehicle to allow 
the construction of "pick-and-mix" conjugates prepared from 
previously fabricated collections of horaotriraeric molecules. 
To illustrate the versatility of this design opportunity by 
5 way of theoretical example, let us assume that (1) a collec- 
tion of twenty different antibody constructs (e.g. in the 
format of single-chain Fv) each of its own characteristic 
binding specificity, has been selected and then turned into 
homo-trimeric molecules by fusion to a tetranectin trimerisa- 
10 tion module, and let us also assume that a set of twenty 

different effector molecules (e.g. toxin domains) have simi- 
larly been prepared and also conjugated to the tetranectin 
trimerisation module. A user provided with prefabricated 
collections of twenty different antibody constructs and 
15 twenty different toxin constructs - 40 different reagents in 
all - has the opportunity then to prepare 400 different 
toxin -antibody conjugates, simply by mixing a first preferred 
component from one reagent collection with a second preferred 
reagent from the other collection and then subject this 
20 binary mixture to conditions, i.e. gentle heating or incuba- 
tion with a suitable level of denaturant, to accomplish 
subunit exchange among all trimeric molecular species in the 
mixture. After the subunit exchange step the desired hetero- 
bifunctional reagent will be present in the mixture as a 
25 major component of the mixture and may then be deployed as 
such to accomplish a given purpose or, alternatively apply a 
simple purification step to isolate his favoured hetero- 
functional binary reagent from any remaining mono -functional 
trimer species by a simple standard protein purification 
30 step, easily designed using standard techniques known in the 
field of protein purification. 

A further enhancement of the versatility of the "pick-and- 
mix" technology may be achieved by including a specific 
affinity purification tag on each array of trimerisation 
35 module - probe/effector/indicator conjugate, fused directly 
in-line or, alternatively, fused via a cleavable linker (a 
polypeptide segment containing e.g. a factor or an 
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enterokinase recognition/cleavage site) to the affinity tag. 
More specifically, if each of three libraries were tagged 
with affinity handles a, b and c, respectively, that were 
recognised by binding siibstances A, B and C, respectively, 
5 pure heterotrimers, composed of one member of each library, 
could be obtained in a three -step affinity purification 
procedure designed to allow selective recovery of only such 
trimers that exhibit affinity for substances A and B and C. 
If, for any reason, subsequent removal of the affinity tags 

10 were desirable, and the constructs had been prepared to 

include cleavable linkers, isolation of the pure heterotri- 
mer, liberated from all affinity tags, could be accomplished 
by three further affinity purification steps, arranged to 
isolate only material that would bind to neither substance A 

15 nor substance B nor substance C. 



Obviously, the scope of "pick-and-mix" design of user-prepar- 
able heterofunctional complexes apply not only to the forma- 
tion of binary hetero- functionality, but would apply by logic 
extension to the formation of ternary hetero- functionality : 

20 To envisage the wealth of possibilities that are inherent to 
the concept of ternary hetero- functionality in a further 
theoretical example along the lines given above, three sets 
of reagent collections, each comprising 20 different func- 
tional characteristics, i.e. a collection of in toto 60 

25 different homotrimers would allow "pick-and-mix" preparation 
of 8,000 different tri- functional molecules. 

The basic tetranectin trimerisation module will, essentially 
indiscriminately, form homo- and hetero-trimers with any 
molecule that also contains a trimerisation module. For some 

30 applications it may be advantageous to have available 

specially engineered derivatives of the tetranectin trimeri- 
sation module, which have been reengineered to disallqw homo- 
trimer formation and hence only allow hetero- trimerisation. 
Thus, an important embodiment of the monomer polypeptide 

35 construct of the invention is constructed/reengineered so as 
to disfavour formation of complexes between identical TTSEs; 
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this also has the implication that oligomers of the invention 
can advantageously be comprised of monomer polypeptide con- 
structs which are designed so as to disfavour formation of 
trimers including two monomer polypeptide constructs having 
i identical TTSEs. One way of disfavouring the formation of 
homotremerisation would be by "knobs .into holes" mutagenisis. 



The design/reengineering may be accomplished by introduction 
of amino acid substitution at sites in the polypeptide inti- 
mately involved in the formation and stability of the trimer 

10 and, simultaneously, in a different construct introduce a 
compensatory amino acid substitution, all in all removing 
symmetry between individual monomer components of the triple 
helical structure so that the structural complementarity 
profile only allows the formation of hetero-trimers, but is 

15 incompatible with some or each of the homotrimer species. 

A yet different way to deploy the tetranectin trimerisation 
module as a vehicle to accomplish rational formation of 
bifunctionalisation would require the interconnection of the 
C-terminus of one monomer to the N-terminus of a second 

20 monomer in the triple-helical structure. The basic require- 
ment for such an intervening polypeptide is, that allowed 
spatial distances between its N- and C-termini must be com- 
patible with the spacing inherent to the structural require- 
ments given by the architecture of the tetranectin trimerisa- 

25 tion module. The construction of an intervening connecting 
polypeptide allowed according to such criteria would be 
readily accomplished by an average person skilled in the art 
of protein engineering, as an ample collection of examples of 
the deployment of, usually flexible, spacer sequences are 

30 known both in nature and in designed proteins. Due to the 
expected entropic contribution to interaction energy in a 
molecule in which two of the three tetranectin trimerisation 
module components are covalently tied together, such a mo- 
lecule would show great preference for selecting any molecule 

35 containing only a single copy of the tetranectin trimerisa- 
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tion module component, as this selection would be energeti- 
cally favoured. Hence, conjugation of one functional protein 
component to a suitably selected covalently dimerised tetra- 
nectin triraerisation module component and conjugation of a 
5 different functional protein component to a single-copy 

element of the trimerisation sequence would provide for the 
preferential formation of a 1:1 bifunctional complex and 
suppression of formation of any other complex. 

The monomers of the invention may be prepared by methods 
generally known in the art, using exclusively or in combina- 
tion the techniques of recombinant protein production, 
peptide synthesis (liquid phase or solid phase) , and tradi- 
tional chemical coupling of heterologous moieties to a 
peptide chain or to specific residues therein. Hence the 
invention also relates to a method of preparing the monomer 
polypeptide construct of the invention, the method comprising 

isolating the monomer polypeptide construct from a cul- 
ture comprising a host cell which carries and expresses a 
nucleic acid fragment which encodes the monomer 
20 polypeptide construct, 

synthesizing, by means of chemical peptide synthesis, the 
monomer polypeptide construct and subsequently isolating 
the monomer polypeptide construct from the reaction 
mixture , 

preparing a TTSE in a culture comprising a host cell 
which carries and expresses a nucleic acid fragment which 
encodes the TTSE, subsequently linking covalently at 
least one heterologous moiety to the TTSE, and thereafter 
isolating the resulting monomer polypeptide construct, or 

synthesizing, by means of chemical peptide synthesis, a 
TTSE, subsequently linking covalently at least one 
heterologous moiety to the TTSE, and thereafter the 
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isolating the resulting monomer polypeptide construct 
from the reaction mixture, 

and optionally siobjecting the monomer polypeptide construct 
to further processing. 

5 The nucleic acid fragment which is mentioned above is also a 
part of the invention and is defined as a nucleic acid frag- 
ment in isolated form which encodes a TTSE as defined herein 
or which encodes the polypeptide part of a monomer 
polypeptide construct according to the invention, with the 

10 proviso that the nucleic acid fragment is different from one 
that encodes native members of the tetranectin family, and 
that the nucleic acid fragment is different from one that 
encodes any of the fusion proteins CIIH6FXTN123, H6FXTN123, 
H6FXTN12, H6FXTN23, the sequences of which are shown in SEQ 

15 ID NOS: 24-27. 

The above mentioned host cell (which is also a part of the 
invention) can be prepared by traditional genetic engineering 
techniques which comprises inserting a nucleic acid fragment 
(normally a DNA fragment) encoding the polypeptide part of a 

20 monomer polypeptide construct of the invention into a suit- 
able expression vector, transforming a suitable host cell 
with the vector, and culturing the host cell under conditions 
allowing expression of the polypeptide part of the monomer 
polypeptide construct. The nucleic acid fragment encoding the 

25 polypeptide may be placed under the control of a suitable 
promoter which may be inducible or a constitutive promoter. 
Depending on the expression system, the polypeptide may be 
recovered from the extracellular phase, the periplasm or from 
the cytoplasm of the host cell. 

30 Suitable vector systems and host cells are well-known .in the 
art as evidenced by the vast amount of literature and 
materials available to the skilled person. Since the present 
invention also relates to the use of the nucleic acid frag- 
ments of the invention in the construction of vectors and in 
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host cells, the following provides a general discussion 
relating to such use and the particular considerations in 
practising this aspect of the invention. 

In general, of course, prokaryotes are preferred for the 
5 initial cloning of nucleic sequences of the invention and 
constructing the vectors useful in the invention. For 
example, in addition to the particular strains mentioned in 
the more specific disclosure below, one may mention by way of 
example, strains such as E. coli K12 strain 294 (ATCC No. 
10 31446), E. coli B, and £. coli X 1776 (ATCC No. 31537). These 
examples are, of course, intended to be illustrative rather 
than limiting. 

Prokaryotes are also preferred for expression, since effi- 
cient purification and protein refolding strategies are 
15 available. The aforementioned strains, as well as E, coli 
W3110 (F-, lambda-, prototrophic, ATCC No. 273325), bacilli 
such as Bacillus subtilis, or other enterobacteriaceae such 
as Salmonella typhimurium or Serratia marcesans, and various 
Pseudomonas species may be used. 

20 In general, plasmid vectors containing replicon and control 
sequences which are derived from species compatible with the 
host cell are used in connection with these hosts. The vector 
ordinarily carries a replication site, as well as marking 
sequences which are capable of providing phenotypic selection 

25 in transformed cells. For example, E. coli is typically 

transformed using pBR322, a plasmid derived from an E. coli 
species (see, e.g.. Bolivar et al., 1977). The pBR322 plasmid 
contains genes for ampicillin and tetracycline resistance and 
thus provides easy means for identifying transformed cells. 

30 The pBR plasmid, or other microbial plasmid or phage must 

also contain, or be modified to contain, promoters which can 
be used by the microorganism for expression. 

Those promoters most commonly used in recombinant DNA con- 
struction include the B-lactamase (penicillinase) and lactose 
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promoter systems (Chang et al . , 1978; Itakura et al., 1977; 
Goeddel et al . , 1979) and a tryptophan (trp) promoter system 
(Goeddel et al., 1979; EPO Appl. Publ. No. 0036776). While 
these are the most commonly used, other microbial promoters 
5 have been discovered and utilized, and details concerning 
their nucleotide sequences have been published, enabling a 
skilled worker to ligate them functionally with plasmid 
vectors (Siebwenlist et al., 1980). Certain genes from proka- 
ryotes may be expressed efficiently in E. coli from their, own 
10 promoter sequences, precluding the need for addition of 
another promoter by artificial means. 



In addition to prokaryotes, eukaryotic microbes, such as 
yeast cultures may also be used. Saccharomyces cerevisiase, 
or common baker's yeast is the most commonly used among 

15 eukaryotic microorganisms, although a number of other strains 
are commonly available. For expression in Saccharomyces, the 
plasmid YRp7, for example, is commonly used (Stinchcomb et 
al., 1979; Kingsman et al., 1979; Tschemper et al., 1980). 
This plasmid already contains the trpl gene which provides a 

20 selection marker for a mutant strain of yeast lacking the 
ability to grow in tryptophan for example ATCC No. 44076 or 
PEP4-1 (Jones, 1977) . The presence of the trpl lesion as a 
characteristic of the yeast host cell genome then provides an 
effective environment for detecting transformation by growth 

25 in the absence of tryptophan. 

Suitable promoting sequences in yeast vectors include the 
promoters for 3 -phosphoglycerate kinase (Hitzman et al., 
1980) or other glycolytic enzymes (Hess et al., 1968; Holland 
et al., 1978), such as enolase, glyceraldehyde- 3 -phosphate 

30 dehydrogenase, hexokinase, pyruvate decarboxylase, phospho- 
fructokinase, glucose -6 -phosphate isomerase, 3 -phosphoglyce- 
rate mutase, pyruvate kinase, triosephosphate isomerase, 
phosphoglucose isomerase, and glucokinase. In constructing 
suitable expression plasmids, the termination sequences 

35 associated with these genes are also ligated into the expres- 
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sipn vector 3' of the sequence desired to be expressed to 
provide polyadenylation of the mRNA and termination. 

Other promoters, which have the additional advantage of 
transcription controlled by growth conditions are the promo- 
5 ter region for alcohol dehydrogenase. 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen 
metabolism, and the aforementioned glyceraldehyde -3 -phosphate 
dehydrogenase, and enzymes responsible for maltose and 
galactose utilization. Any plasmid vector containing a yeast- 
10 compatible promoter, origin of replication and termination 
sequences is suitable. 

In addition to microorganisms, cultures of cells derived from 
multicellular organisms may also be used as hosts. In prin- 
ciple, any such cell culture is workable, whether from verte- 

15 brate or invertebrate culture. However, interest has been 
greatest in vertebrate cells, and propagation of vertebrate 
in culture (tissue culture) has become a routine procedure in 
recent years (Tissue Culture, 1973). Examples of such useful 
host cell lines are VERO and HeLa cells, Chinese hamster 

20 ovary (CHO) cell lines, and W13 8, BHK, COS -7 293 and MDCK 
cell lines. 

Expression vectors for such cells ordinarily include (if 
necessary) an origin of replication, a promoter located in 
front of the gene to be expressed, along with any necessary 
25 ribosome binding sites, RNA splice sites, polyadenylation 
site, and transcriptional terminator sequences. 

For use in mammalian cells, the control functions on the 
expression vectors are often provided by viral material. For 
example, commonly used promoters are derived from polyoma, 
30 Adenovirus 2, and most frequently Simian Virus 40 (SV40) . The 
early and late promoters of SV40 virus are particularly 
useful because both are obtained easily from the virus as a 
fragment which also contains the SV4 0 viral origin of repli- 
cation (Fiers et al . , 1978). Smaller or larger SV40 fragments 
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may also be used, provided there is included the approximate- 
ly 250 bp sequence extending from the Hindlll site toward the 
Bgll site located in the viral origin of replication. Fur- 
ther, it is also possible, and often desirable, to utilize 
5 promoter or control sequences normally associated with the 
desired gene sequence, provided such, control sequences are 
compatible with the host cell systems. 

An origin of replication may be provided either by construc- 
tion of the vector to include an exogenous origin, such as 
10 may be derived from SV40 or other viral (e.g., Polyoma, 
Adeno, VSV, BPV) or may be provided by the host cell 
chromosomal replication mechanism. If the vector is inte- 
grated into the host cell chromosome, the latter is often 
sufficient . 

15 Upon production of the polypeptide monomer constructs it may 
be necessary to process the polypeptides further, e.g. by 
introducing non-proteinaceous functions in the polypeptide, 
by subjecting the material to suitable refolding conditions 
(e.g. by using the generally applicable strategies suggested 

20 in WO 94/18227) , or by cleaving off undesired peptide moie- 
ties of the monomer (e.g. expression enhancing peptide frag- 
ments which are undesired in the end product) . 

In the light of the above discussion, the methods for recom- 
binantly producing the monomer polypeptide construct of the 

25 invention are also a part of the invention, as are the 
vectors carrying and/or being capable of replicating the 
nucleic acids according to the invention in a host cell or a 
cell-line. According to the invention the expression vector 
can be e.g. a plasmid, a cosmid, a minichromosome , or a 

30 phage. Especially interesting are vectors which are inte- 
grated in the host cell/cell line genome after introduction 
in the host. 

Another part of the invention are transformed cells (useful 
in the above -described methods) carrying and capable of 
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replicating the nucleic acid fragments of the invention; the 
host cell can be a microorganism such as a bacterium, a 
yeast, or a protozoan, or a cell derived from a multicellular 
organism such as a fungus, an insect cell, a plant cell, or a 
5 mammalian cell. Especially interesting are cells from the 

bacterial species Escherichia, Bacillus and Salmonella, and a 
preferred bacterium is E. coli. 

Yet another part of the invention relates to a stable cell 
line producing the polypeptide part of a monomer polypeptide 
10 construct according to the invention, and preferably the cell 
line carries and expresses a nucleic acid of the invention. 

On the basis of the abovis discussions it will be clear to the 
skilled person that also the oligomers resulting from the 
complex formation between the monomer constructs of the 

15 invention are important parts of the invention. Hence the 

invention also pertains to an oligomer which is comprised of 
two monomer polypeptide constructs according to the invention 
which comprises at least three TTSEs, or which is comprised 
of three monomer polypeptide constructs according to the 

20 invention which each only contain one single TTSE. 

As is explained herein and shown in the examples, the oligo- 
mers of the invention are stable at temperatures up to 70^C 
and it is therefore especially preferred that the oligomers 
of the invention are stable at temperatures above physiologi- 
25 cal ones, e.g. that the oligomers are stable in the tempera- 
ture range 50-70°C. 

Also a part of the invention is a method for preparing a 
dimeric oligomer of the invention which comprises 

admixing a monomer polypeptide construct which includes 
30 two TTSEs (construct 1) with a monomer polypeptide con- 

struct which includes only one TTSE (construct 2) , 
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effecting the two TTSE's of construct 1 to complex with 
the TTSE of construct 2 (this can be done by thermal 
treatment, i.e. heating to a temperature which ensures 
denaturation followed by subsequent cooling allowing 
5 renaturation, or this can be done by dena- 

turing/renaturing effected by changes in the chemical 
environment) , and 

isolating the resulting dimer and optionally subjecting 
the dimer to further processing (cf . the above discussion 
10 of further processing, but it should also be mentioned 

that the further processing could include non-covalent 
coupling of interesting and relevant moieties to the 
dimeric oligomer) . 

Consequently, the method for producing a trimeric oligomer is 
15 also a part of the invention and comprises the steps of 

admixing three monomer polypeptide constructs of the 
invention with each other, 

effecting complex formation between one TTSE of each 
monomer polypeptide construct, and 

20 - isolating the resulting trimer and optionally subjecting 
the trimeric oligomer to further processing. 

The considerations applying to complex formation and further 
processing mentioned above apply to this method also. 

In view of the detailed discussion above of the "pick-and- 
25 mix" aspect of the invention, the invention also pertains to 
a kit comprising 

a first package comprising at least one container means, 
each at least one container means containing a monomer 
polypeptide construct of the invention, 
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a second package comprising at least one container means, 
each at least one container means in the second package 
containing a monomer polypeptide construct of the inven- 
tion, the second package being different from the first 
5 package with respect to choice and/or number of monomer 

polypeptide constructs included therein, and optionally 

a third package comprising at least one container means, 
each at least one container means in the third package, 
containing a monomer polypeptide construct of the inven- 
10 tion, the second package being different from the first 

and second packages with respect to choice and/or number 
of monomer polypeptide constructs included therein. 

It is preferred that the at least one container means in each 
package contains mutually distinct monomer polypeptide con- 
15 structs, and it is especially preferred that all container 
means comprised in the kit comprises mutually distinct 
polypeptide constructs. 

A very important aspect of the invention is the possibility 
of generating a system designed especially for the individual 
20 circumstances. The basic idea is that the artificial 

selection of heterologous moieties and optionally active 
components, and functional entities result in a unique system 
as will be further disclosed in the following. 

25 Using the TTSE as^a vehicle for assembling monovalent scFv or 
Fab antibody fragments into oligomeric and multivalent 
entities offer design advantages also in terms of generating 
chimaeric artificial antibodies having desirable 
pharmacokinetic and pharmacodynamic properties. Small 

30 derivatives like monomeric scFv fragments or bivalent 

"minibodies" are rapidly cleared from the circulatory system, 
whereas complete Igs stay for very much longer. Conversely, 
small derivatives like scFv and minbodies exhibit better 
extravasation properties. It is therefore expected that 

35 antibodies of a desired specificity may be optimized for 
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particular diagnostic or therapeutic needs by engineering the 
pharmacological properties, using the TTSE as a vehicle for 
controlled oligomerization of e.g. scFv fragments. 

One example of such engineering would be the requirements for 
5 delivering a high dose of an imaging- or toxin-conjugated 
antibody to a tumour, while ensuring as low a systemic 
exposure or imaging background as possible. In such case a 
TTSE conjugated scFv fragment could be designed to exhibit 
strong multivalent binding to the tumour and rapid clearance 
10 of excess conjugate from circulation. 

Accordingly, in af further aspect the present invention also 
relates to the use of a monomer polypeptide construct or to a 
an oligomer according to the present invention as a vehicle 
for assembling antibody fragments into oligomeric or 
15 multivalent entities for generating chimeric artificial 
antibodies having preselected pharmacokinetic and/or 
pharmadynamic properties . 

The use of specific delivery systems also play an important 
role in connection with the present invention in that such 
20 systems may by utilized with respect to different use of the 
present invention both with respect to the a more general 
therapeutic application and with respect to gene therapy. 
Exampels of suitable drug delivery and targeting systems are 
disclosed in Nature 392 supp. (30 april 1998). 

25 Accordingly, efficiency of deliverance may be further 

increased if the delivery system e.g a liposome is supplied 
with a molecular unit, an "infector or transfector" ligand, 
recognized by a internalizing receptor unit specific for the 
target cells. For example, cells displaying endocytotic 

30 receptors like members of the LDL family of receptors may be 
even more efficiently infected or transfected either by 
including a TTSE unit in the antibody containing heteorimer 
or in an independent TTSE unit conjugated to one or more of 
the domains of the Receptor Associated Protein, RAP, 
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(Ellgaard, L., Holtet, T.L., Nielsen, P.R., Etzerodt, M., 
Gliemarm, J. & Thogersen, H.C. Eur J Biochem. 1997, vol 244, 
544-551) which is recognized as a ligand to all receptors in 
this abundant family of endocytosis -mediating receptors. 

5 Accordingly, in a further aspect, the invention is directod 
to the use of a monomer polypeptide construct or to an 
oligomer according to the invention for targeted gene therapy 
involving selective delivery of the material for transfection 
or infection of the specific population of cells. 

10 The ultimate perspective of such TTSE-mediated gene therapy 
would be the deployment of a viral vector that would find no 
other targets in the patient but the cells displaying the 
artificial receptor complex. 

In a still further aspect, the invention is directod to the 
15 use of a monomer polypeptide construct or to a an oligomer 

construct according to the invention wherein the at least one 
heterologous moity comprises a moiety selected from a ligand 
binding structure such as a receptor molecule or the ligand 
binding part of a receptor molecule, and wherein the gene 
20 therapy involves the delivery of nucleic acids to the desired 
population of cells by use of a viral vector directed to 
cells displaying the artificial receptor complex 
corresponding to the heterologous moity. 

In another aspect, the invention is directod to the use of a 
25 monomer polypeptide construct or to a an oligomer construct 
according to the invention wherein at least one TTSE is 
modifyed with a membrane integrating or associating entity 
having affinity to the specific population of cells in the 
body relevant for the gene therapy. 

3 0 Furthermore, a recent review of the imaging an therapeutic 
potential of a range of known antibody derivatives has been 
published by Paul Carter and Margaret Merchant of Genentech 
Inc. (Current Opinion in Biotechnology, 1997, vol 8, 449- 
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454) . In direct continuation of their conclusions it will be 
apparent that oligomersation of antibody derivatives like 
scFv derivatives may extend current technology in the 
designer-antibody field in many important aspects, some of 
5 which will be elaborated below (with reference to the Carter 
& Merchant review) . 

One of the well-known problems inherent to mouse monoclonal 
antibodies that have been 'humanized' by grafting of the 
murine antigen combining site onto a human Ig framework is 

10 that antigenicity of the chimaeric product in human patients 
is often difficult to suppress entirely, resulting in - 
sometimes life-threatening - immune reactions to the 
diagnostic or therapeutic humanized antibody product. The 
risk of such side-effects are expected to be much reduced if 

15 the designer antibody is assembled from purely human proteins 
or protein fragments. Since the TTSE trimerising unit 
described here is identical to a portion of human tetranectin 
that is already present in human plasma and tissue, there is 
good reason to expect that the TTSE will not elicit an 

20 antigenic response in a human subject if it is introduced as 
a component of a chimaeric product that is not otherwise 
antigenic in humans . 

Accordingly, in one aspect, the present invention relates to 
the use of a monomer polypeptide construct or to a an 
25 oligomer according to the present invention as a component of 
a chimaeric product having low antigenicity in humans 
relative to formulations comprising on or more components of 
non-human origin. 

Carter & Merchant further review present technology for 
30 radiolabelling of antibody derivatives. Again, 

oligomerisation using TTSEs offer more elegant solutions to 
problems associated with labelling, as the TTSE offers the 
possibility to construct one or two of the TTSE monomer units 
in a heterotrimeric complex to harbour the site carrying the 
35 label. Thus, in this format labelling may also be confined to 
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the non-antibody part of the complex, leaving the antigen- 
binding module entirely unmodified, and the complex may 
furthermore be formulated "in the field" as and when needed. 

In many receptor-mediated signal transduction pathways 
5 signals are triggered by the clustering of receptor molecules 
on the cell membrane. The TTSEs therefore have important 
applications in the study and exploitation of receptor 
signalling, as ligands may be presented as oligomers by 
conjugation to a TTSE unit. 

10 This also has important application in phage display 

technologies for discovering new ligands and new receptors as 
the engineering of a TTSE unit fused inline to a candidate 
ligand molecule will allow the display of a hetero-trimeric 
phage coat protein, in which only one of the monomer units is 

15 conjugated to the phage coat protein. This may be 

accomplished by appropriate insertion of amber codons at the 
site of fusion of phage coat protein to the TTSE-ligand 
segment of the three-way fusion protein encoded by the 
recombinant phage. In appropriate E. coli cells the presence 

20 of this amber codon will result in translation termination in 
the majority of read-throughs, and hence most of the fusion 
protein product secreted to the periplasmic compartment in 
the phage -infected bacterium will be soluble TTSE-ligand 
fusion protein, whereas a minority of the fusion protein will 

25 also contain a phage protein module. The majority of t rimers 
that will be generated will therefore contain, at most, one 
monomeric unit that will ensure integration (display) in the 
mature recombinant phage particle. 

A further advantage of the display tecnology discribed above 
30 relates to the fact that it is specially useful for selection 
on the basis of a relatively low affinity because of the 
entropic benefit contribution obtained by the proximity of 
the tree binding moities in confined spatial arrangement. 
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Accordingly, the present invention in an important aspect, 
also relates to protein library technology wherein the TTSE's 
discribed above are utilized. 

The trimerisation of candidated recombinant ligands is 
5 especially important as, for many receptors, the 

intracellular signal is induced by receptor clustering, which 
is only brought about if the external ligand exhibits ' 
multivalent binding to the receptor, so as to bridge two or 
more receptor molecules. 

10 In one preferred embodiment the monomer polypeptide construe 
or the oligomer construct according to the invention is for 
targeted gene therapy involving selective delivery of the 
material for transfection or infection of the specific 
population of cells. Tthe at least one heterologous moity may 

15 comprise a moiety selected from a ligand binding structure 
such as a receptor molecule or the ligand binding part of a 
receptor molecule, and wherein the gene therapy involves the 
delivery of nucleic acids to the desired population of cells 
by use of a viral vector directed to cells displaying the 

20 artificial receptor complex corresponding to the heterologous 
moity. 

As mentioned above, it is an important aspect of the 
invention that the monomer polypeptide construct and/or the 
oligomer may be used as a component of a chimaeric product 
25 having low antigenicity in humans. As the construe is of 

human origin it is believed that the antigenicity in humans 
is low relative to formulations comprising on or more 
components of non-human origin. 

One primary use of a monomer polypeptide construct or a an 
30 oligomer according to the invention is for delivering an 
imaging or toxin -conjugated antibody to a target such as a 
tumor, or use as a vehicle delivering an substance to a 
target cell or tissue, as a vehicle for assembling antibody 
fragments into oligomeric or multivalent entities for 
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generating chimeric artificial antibodies having preselected 
pharmacokinetic and/or pharmadynamic properties. 

The substance in question being one or more selected from the 
group of heterologous moities as well a pharmaceutical. Also 
5 a labelled construct wherein the label is coupled to one or 
to of the TTSE monomer units is within the scope of the 
invention. 

As explained in detail previously, an important and 
surprising use of the monomer polypeptide construct or the 
10 oligomer according to the pesenent invention is for protein 
library technology, such as phage display technology. The 
present invention also relates to any polynucleotid molecule 
such as a RNA, DNA or PNA as well as any vector encoding one 
or more TTSE. 

15 A further use according to invention includes preparation and 
use of a pharmaceutical composition comprising the TTSE 
construct and optionally a pharmaceutical ly acceptable 
excipient. The composition may be administered by a route 
selected from the group consisting of the intraveneous route, 

20 the intraarterial route, the transmembraneus route of the 

buccal, anal, vaginal or conjunctival tissue, the intranasal 
route, the pulmonary route, the transdermal route, the 
intramuscular routed, subcutaneous route, intratechal route, 
inoculation into tissue such as a tumour, or by an implant. 

25 The the monomer polypeptide construct or the oligomer is in a 
preferred embodiment comprised in a liposome. 

It is obvious from the disclosure of the present invention 
that the treating or preventing of a disease may by a further 
aspect comprising administering to the subject in need 
30 thereof an effective amount of a pharmaceutical composition 
referred to above. 
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In one aspect of the various possibilities according to the 
present invention concerning how the human gene therapy is 
targeted, includes the case wherein at least one TTSE is 
modified with a membrane integrating or associating entity 
5 having affinity to the specific population of cells in the 
body relevant for the gene therapy. 

As used in the conventional pharmaceutic field the present 
invention includes a method wherein the monomer polypeptide 
construct or the oligomer is administered by a route selected 

10 from the group consisting of the intraveneous route, the 

intraarterial route, the transmembraneus route of the buccal, 
anal og vaginal tissue, intranasal route, the pulmonary 
route, the transdermal route, intramuscular, subcutaneous, 
intratechal, the buccal, inoculation into tissue such as a 

15 tumour, or by an implant. 

Finally, the present invention is also relating to the field 
of dianosing as the skilled person would easyly recognice, 
that the TTSE disclosen in the present inventiion may also 
refer to a method for diagnosis comprising a construct 
20 comprising the monomer polypeptide constnact or the oligomer, 
together with a diagnosing component coupled thereon. 

EXAMPLE 1 

Design and construction of the pTH6trip E. coli expression 
vectors for the production of trimerised chimeric fusion 
25 proteins. 

The plasmid clone pT7H6FXTN123 (Example 2) was used as tem- 
plate for amplification in two Polymerase Chain Reactions 
(PGR) (Saiki et al . , 1988) with the primer pairs trip-N (SEQ 
ID NO: 1) and trip-Ca (SEQ ID NO: 2) and trip-N (SEQ ID NO: 
30 1) and trip-Cb (SEQ ID NO: 3), respectively. The amplified 
DNA fragments, trijoa, comprising nucleotide sequences enco- 
ding an IQGR cleavage site for the restriction protease FX^^ 
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(SEQ ID NO: 4) followed by two sites for the restriction 
nucleases Bglll and Kpnl, the nucleotide sequence encoding 
the tetranectin polypeptide sequence for Glu 1 to Lys 52 (SEQ 
ID NO: 5) followed by recognition sites for the three re- 
5 striction endonucleases BamHI, Hindlll, and EcoRI, respec- 
tively, and tripb , comprising nucleotide sequences encoding 
an IQGR cleavage site for the restriction protease FX^ (SEQ 
ID NO: 4) followed by two sites for the restriction nucleases 
Bglll and Kpnl, the nucleotide sequence encoding the tetra- 
10 nectin polypeptide sequence for Glu 1 to Val 49 (SEQ ID NO: 
6) followed by recognition sites for the three restriction 
endonucleases BamHI, Hindlll, and EcoRI, respectively, were 
subcloned into the plasmid pT7H6 (Christensen et al . , 1991), 
yielding pTtripa and pTtripb, respectively (Figs. 3 and 4) . 

15 EXAMPLE 2 

Tetranectin, localisation of the trimerising structural 
element and stability of the triple alpha helical coiled 
coil . 

The cDNA encoding the reading frame corresponding to the 

20 mature tetranectin single chain (SEQ ID NO: 7) was cloned by 
specific amplification in a Polymerase Chain Reaction (PGR) 
(Saiki et al . , 1988) of the nucleotide sequences from amino 
acid residue Glul to Vall81 using 1^^ strand oligo-dT primed 
cDNA synthesized from total human placental RNA as template. 

25 Primers used in the PGR were SEQ ID NO: 8 and SEQ ID NO: 9. 
RNA extraction and cDNA synthesis were performed using stan- 
dard procedures. The amplified reading frame encoding the 
monomer subunit of tetranectin was at the 5' -end, via the 
PGR- reaction, linked to nucleotide sequences encoding the 

30 amino acid sequence SEQ ID NO: 10 which constitute an lEGR 
cleavage site for the bovine restriction protease FX^ (Nagai, 
and Thagersen, 1987) . A glycine residue was, due to the 
specific design of the 5' -PGR primer (SEQ. ID NO. 8), 
inserted between the G- terminal arginine residue of the FX^ 

35 cleavage site (SEQ ID NO. 10) and the tetranectin Glul-resi- 
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due. The amplified DNA fragment was subcloned into the E. 
coli expression vector pT7Hg (Christensen at al., 1991) 
producing the plasmid pT7HgFX-TN123 expressing the tetranec- 
tin monomer H6FXTN123 (SEQ ID NO: 25) and into pT7CIIHg, 

5 which is a derivative of pTj^S' where the amino -terminal 32 
amino acid residues of the lambda CII protein (SEQ ID NO. 11) 
are inserted 5' of the six histidine residues (SEQ ID NO. 12) 
as outlined in Fig. 5, yielding PT7CIIH6FX-TNI23 expressing 
the tetranectin fusion protein CIIH6FXTN123 (SEQ ID NO: 24). 

10 The amino acid sequence of the expressed proteins are shown 
in Fig. 6 (in SEQ ID NO: 7 is given the amino acid sequence 
of the mature tetranectin protein) . Furthermore three addi- 
tional derivatives of tetranectin were constructed (Fig. 8): 
H6FXTN12 comprising the tetranectin amino acid residues Glul 

15 to Val49 (SEQ ID NO: 6) , H6FXTN23 comprising the tetranectin 
amino acid residues Vall7 to VallSl (SEQ ID NO: 7) , and 
H6FXTN3 (SEQ ID NO: 30) comprising the tetranectin amino acid 
residues Ala45 to VallSl (SEQ ID NO: 7). These three tetra- 
nectin derivatives were constructed by specific amplification 

20 in a PGR using pT7HgFX-TN123 as template and the primer-pairs 
SEQ ID NO: 8 and SEQ ID NO: 13, SEQ ID NO: 14 and SEQ ID NO: 
9, and SEQ ID NO: 15 and SEQ ID NO: 9, respectively. The 
amplified DNA fragments were subcloned into the E. coli 
expression vector pT7Hg producing the plasmids pT7HgFX-TN12 , 

25 pT7HgFX-TN23, and pT7HgFX-TN3 , respectively (Fig. 7), 

To prepare recombinant tetranectin and its derivatives, each 
of the plasmids PT7H6FX-TNI23 , pT7CIIHgFX-TN123 , PT7H6FX-TNI2, 
pT7HgFX-TN23, and pT7HgFX-TN3 were grown in medium scale (4 x 
1 litre; 2xTY medium, 5 mM MgS04 and 100 fig ampicillin) in E. 
30 coli BL21 cells, as described by Studier et ai . (1990). 
Exponentially growing cultures at 37*^0 were at OD^oo 0-8 
infected with bacteriophage lambda CE6 at a multiplicity of 
approximately 5. Cultures were grown at 3 7<>C for another 
three hours and the cells harvested by centrifugation. 

35 Cells were resuspended in 150 ml of 0 . 5 M NaCl, 10 mM 

Tris-HCl pH 8, and 1 mM EDTA pH 8. Phenol (100 ml adjusted to 
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pH 8) was added and the mixture sonicated to extract the 
total protein. Protein was precipitated from the phenol phase 
by 2.5 volumes of ethanol and centrif ugation. 

The protein pellet was dissolved in a buffer containing 6M 
5 guanidinium chloride, 50 mM Tris-HCL pH 8 and 0.1 M dithio- 
erythriol. Following gel filtration on Sephadex G-25 (Pharma- 
cia, Sweden) into 8 M Urea, 1 M NaCl, 50 mM Tris-HCl pH 8 and 
10 mM 2-mercaptoethanol, the crude protein preparation wa3 
applied to a Ni^-" activated NTA-agarose column (Ni^^NTA-aga- 
10 rose, 75 ml pre-washed with 8 M urea, 1 M NaCl, 50 mM 

Tris-HCl pH 8, and 10 mM 2-mercaptoethanol) for purification 
(Hochuli et al . , 1988) and refolding of the fusion proteins, 
H6FXTN123, CIIH6FXTN123, H6FXTN12, H6FXTN23, and H6FXTN3 . 

For this study we chose to prepare our own Ni^+NTA- agarose 
15 matrix. A carbodiimide coupling of the N- (5-amino-l-carboxy- 
pentyl) iminodiacetic acid metal ligand (synthesis route as 
described by Dobeli & Hochuli (EP-A-0 253 303)) to a rigid 
agarose matrix (Sepharose CL-6B, Pharmacia, Sweden) was 
performed: 

20 8 g of N- (5-amino-l-carboxypentyl) iminodiacetic acid from the 
synthesis procedure in 50 ml was adjusted to pH 10 by addi- 
tion of 29 g of Na2CO3(10 H2O) and added to a stirred suspen- 
sion of activated Sepharose CL-6B in 1 M Na2C03 . Reaction was 
allowed overnight. The Sepharose CL-6B (initially 100 ml 

25 suspension), was activated after removal of water by acetone 
with 7 g of 1,1' -carbonyldiimidazol under stirring for 15 to 
30 min. Upon activation the Sepharose CL-6B was washed with 
acetone followed by water and 1 M Na2C03. 

The NTA-agarose matrix was loaded into a column and "charged" 
30 with Ni^"*" by slowly passing through 5 column volumes of a 10% 
NiS04 solution. The amount of Ni^^ on the NTA-agarose matrix, 
prepared by "this procedure, has been determined to 14 /imol 
per ml matrix. After charging the Ni^^'NTA- agarose column was 
washed with two column volumes of water, one column volume of 
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1 M Tris-HCl pH 8 and two column volumes of loading buffer 
before stirred mixing of the Ni^^NTA- agarose matrix with the 
crude protein extracts in a breaker for 15 to 30 min. All 
buffers prepared for liquid chromatography were degassed 
5 under vacuum prior to addition of reductant and/or use. 

The Ni^^NTA-agarose matrix - crude extract mixture was packed 
in standard glass columns for liquid chromatography (internal 
diameter: 2.6 cm) to a volume of approximately 4 0 ml. The 
columns were washed with 200 ml of 8 M urea, 1 M NaCl, 50 mM 

10 Tris-HCl pH 8, and 10 mM 2-mercaptoethanol (Buffer I) and 100 
ml 6 M guanidinium chloride, 50 mM Tris-HCl pH 8 and 10 mM 
2-mercaptoethanol (Buffer II) and the adsorbed tetranectin 
derived fusion proteins H6FXTN123, H6CIIFXTN123 , H6FXTN23, 
and H6FXTN3 refolded using the cyclic refolding procedure as 

15 described (Thagersen et al., WO 94/18227). 

The fusion protein H6FXTN12 was refolded by removing the 
guanidinium chloride and 2-mercaptoethanol of buffer II in a 
gradient over 5 column volumes into 50 mM Tris-HCl pH 8 and 
0.5 M NaCl. After completion of the refolding procedures the 

20 tetranectin derived fusion proteins were eluted from the 

Ni^'*"NTA-agarose columns with a buffer containing 0.5 M NaCl, 
50 mM Tris-HCl, 25 mM EDTA pH 8. The tetranectin fusion 
proteins H6FXTN123, H6FXTN23, and H6FXTN3 were cleaved with 
FX^ at 4°C overnight in a molar ratio of 1:300. After FX^ 

25 cleavage the protein samples were concentrated 10. fold by 
ultrafiltration on YMIO membranes (Amicon) . After ten times 
dilution of the protein sample with 2 mM CaCl2i the recombi- 
nant tetranectin derivatives TN123, TN23, and TN3 were iso- 
lated by ion-exchange chromatography on Q-Sepharose (Pharma- 

30 cia, Sweden) in a linear gradient over 10 column volumes from 
10 mM Tris-HCl pH 8, 2 mM CaCl2 to 10 mM Tris-HCl pH 8, 2 mM 
CaCl2, and 0,5 M NaCl. After elution from the Ni^'^NTA- agarose 
columns the fusion proteins H6CIIFXTN123 and H6FXTN12 were 
likewise concentrated 10 fold by ultrafiltration on YMIO 

35 membranes and gelfiltrated into buffer containing 25 mM Tris- 
HCl pH 8, 25 mM NaCl, and 2 mM CaCl2, before purification of 
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correctly folded monomer by ion-exchange chromatography on 
Q-Sepharose as described. 

Recombinant full length tetranectin (TN123) produced by these 
procedures have been analyzed with respect to binding to 
5 plasminogen kringle 4 and immobilised fucoidan, expression of 
antigenic sites, and localization of disulphide bridges. In 
all criteria tested the produced TN123 behaved identically to 
isolated naturally human tetranectin (data not shown) . Fur- 
thermore TN123 and TN3 have been crystallized (Kastrup et 
10 al.,1996) and the structure has also been determined, all of 
which bear evidence that a single unique and biologically 
active folded product had indeed been produced. 

Analytical gelfiltration analysis of rTN proteins. 

Analytical gelfiltration of the recombinant tetranectin 
15 derivatives TN123, TN3, and TN23 (Pig. 9) were performed on a 
Superose 12 HR 10/30 column (Pharmacia, Sweden) with a total 
volume of 25 ml in 100 mM NaCl and 50 mM Tris-HCl pH 8 and a 
flow rate of 0.2 ml/min. The K^^ value is defined by, 
Ka^=(Ve-Vo)/(Vc-Vo) . 

20 The gelfiltration analysis of TN123 and TN23 show that both 
proteins are exclusively found as t rimers in solution (K^^ 
values of 0.27 and 0.29, respectively), whereas TN3 appeared 
monomeric (K3^^:0.41). 

Chemical cross-linking of tetranectin and derivatives 

25 The recombinant tetranectin derivatives TN123, TN3, and TN23, 
together with the fusion proteins CIIH6FXTN123 and H6FXTN12 
or mixtures of these derivatives at 1 mg/ml concentrations in 
cross-linking buffer (0.1 M Sodium borate, pH 9.1) were 
incubated with dimethyl suberimidate (DMSI, Sigma) . 10 /il 

30 aliquots of protein solution were incubated with 1 fil ali- 
quots of DMSI stock solution (20 mg/ml in cross-linking buf- 
fer) for 30 minutes at 25°C before addition of 2 fil quenching 
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buffer (3 M Tris-HCl, pH 9) . Subunit exchange between pre- 
formed homo-oligomers was induced by subjecting protein 
mixtures to heat shock treatment- Five /xl aliquots of each 
protein solution (1 mg/ml stocks) were mixed at O^C in stan- 
5 dard polypropylene microcentrifuge tubes, transferred to a 
water bath at 70**C for the time spans- indicated, and then 
further incubated for 15 minutes at 25°C before reaction with 
DMSI. 

10 Prior to analysis by SDS-PAGE (12% gels) of. the cross -linked 
products the reaction samples were boiled in the presence of 
SDS and mercaptoethanol . 

Cross -linking analysis of TN123 and the fusion protein 
CIIH6FXTN123 showed that no detectable subunit exchange 

15 between pre -formed homo-oligomers in a mixture of TN123 and 
CIIH6FXTN123 was found after 16 hours at room temperature 
(Fig. 10) . Subunit exchange could be induced by incubating 
the protein mixture at 70<>C for 15 seconds or longer before 
cooling to room temperature and addition of DMSI. SDS-PAGE 

20 analysis showed the presence of four trimer bands above 95 
kDa (corresponding to two homo-trimers and two hetero-tri- 
mers) and three dimer bands (corresponding to two homo-dimers 
and one hetero- dimer) in the gel between 43 and 55 kDa, in a 
relative abundance in agreement with random association of 

25 monomer subunits into trimers after subunit exchange. It 

should be noted, that molecular weight markers have only been 
included on the SDS-PAGE gels for crude calibration and 
orientation of the gels. 

The trimeric organization of tetranectin was further corro- 
30 borated by cross-linking studies of the proteins H6FXTN12, 

TN23, and TN3 and mixtures between them (Fig. 11). The tetra- 
nectin derivative TN3, containing only the CRD, could not be 
cross -linked even at high protein concentrations and did not 
interfere with the cross-linking of rTN123 . Likewise, the 
35 derivative TN23, containing exon 2 and the CRD, appeared 

monomeric after cross-linking and was found not to interfere 
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with trimerisation of TN123 during subunit exchange. Dimeric 
TN23 molecules found at low abundance in the sample probably 
reflects contaminating misfolded disulphide bridged dimers. 
The fusion protein H6FXTN12 formed homo-trimers upon cross- 
5 linking and generated hetero-trimers with TN123 after subunit 
exchange. Because of the difference in size of full length 
tetranectin (TN123) and H6FXTN12 the possible nine protein 
bands resulting from chemical cross-linking are: The four 
trimers [(TN123)3, (TN123) 2 (H6FXTN12) , (TN123 ) (H6FXTN12) 2 , 
10 and (H6FXTN12)3] at approx. 95 kDa , 50 kDa, 37 kDa, and 20 
kDa, respectively; the three dimers [{TN123)2, 
(TN123) (H6FXTN12) , and (H6FXTN12)2] at approx. 45 kDa, 30 
kDa, and 15 kDa, respectively; and the two monomers TN123 at 
23 kDa and H6FXTN12 at 9 kDa. 

15 Taken together, the gel filtration and the cross -linking 

analysis of the tetranectin derivatives show that tetranec- 
tin, like the collectin group of C-type lectins, is a 
trimeric molecule and that amino acid residues directly shown 
to be involved in trimerisation of the tetranectin monomer 

20 are located in exon 2 of the protein {Vall7 - Val49) . Fur- 
thermore subunit exchange between the trimeric molecules 
could only be observed after heat denaturation . Amino acid 
residues Glul to AsplG of tetranectin are critical to chemi- 
cal cross -linking with DMSI and more important appear to 

25 stabilize the trimeric molecule because the cross-linking 

analysis of the mixture TN123 and TN23 showed no decrease in 
TN123 formation after heat denaturation and possible subunit 
exchange (Fig. 11) . The stability of the tetranectin trimer 
was corroborated by a cross -linking analysis with DMSI at 

30 different temperatures. Fifteen /xl TN123 at 0.3 mg/ml concen- 
tration was pre -incubated 10 min, at either 27 ''C, 50*>C, SO^'C, 
or 70 °C before addition of 2 fil DMSI (20 mg/ml) . The reaction 
was allowed to proceed for 15 min. before reaction was 
quenched with 5 /xl of 3M Tris-HCl pH 9 . 1 and the reaction 

35 mixtures allowed to cool to room temperature. SDS-PAGE analy- 
sis of reduced samples (Fig. 12) showed, that trimers are 
readily detectable even at 60^C, although a competing pattern 
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of cross -linking specimens increases at increasing, tempera- 
tures. The appearance of other cross-linking specimens is 
probably due to the unfolding of the CRD. The stability of 
the tetranectin trimerising structural element is further 
5 analyzed using a designed chimeric protein in Example 3. 

EXAMPLE 3 



Design and construction of the recombinant chimeric protein 
TRIPB-UB - the tetranectin trimerising structural element and 
ubiquitin, 

10 A plasmid clone, pLCMHF/UB, generously provided by Dr. 0. 

Wiborg harbouring a human ubiquitin cDNA insert (SEQ ID: 16) 
was used as template and SEQ ID NO: 17 together with SEQ ID 
NO: 18 were used for amplification in a Polymerase Chain 
Reaction (PGR) (Saiki et al., 1988) of the nucleotide 

15 sequence encoding amino acid residue Ilel to Gly76 of human 
ubiquitin (SEQ ID:^ 19) . The amplified DNA fragment was after 
digestion with the restriction endonucleases BamHI and 
Hindlll ligated into the BamHI and Hindlll sites of pTtripb 
(Example 1) yielding pTtripb-UB (Fig. 13) using standard 

20 procedures. 

To prepare the chimeric fusion protein H6FXtripb-UB (Fig. 14, 
SEQ ID NO: 31) the plasmid pTtripb-UB was grown in medium 
scale (4x1 litre; 2xTY medium, 5 mM MgS04 and 100 fig 
ampicillin) in E. coli BL21 cells, as described by Studier et 

25 al. (1990). Exponentially growing cultures at 31^C were at 
ODgoo 0.8 infected with bacteriophage lambda CE6 at a multi- 
plicity of approximately 5. Cultures were grown at 37«>C for 
another three hours and the cells harvested by 
centrifugation. Cells were resuspended in 150 ml of 0.5 M 

30 NaCl, 10 mM Tris-HCl pH 8, and 1 mM EDTA pH 8. Phenol (100 ml 
adjusted to pH 8) was added and the mixture sonicated to 
extract the total protein. Protein was precipitated from the 
phenol phase by 2.5 volumes of ethanol and centrifugation. 
The protein pellet was dissolved in a buffer containing 6 M 
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guanidinium chloride, 50 mM Tris-HCl pH 8 and 0.1 M dithio- 
erythriol. Following gel filtration on Sephadex G-25 (Pharma- 
cia, Sweden) into 8 M Urea, 1 M NaCl, 50 raM Tris-HCl pH 8 and 
10 mM 2-mercaptoethanol, the crude protein preparation was 
5 applied to a Ni^"" activated NTA- agarose column for purifi- 
cation (Hochuli et al . , 1988) and refolding of the fusion 
protein H6FXtripb-UB. 

Synthesis and charging of the Ni^"" activated NTA-agarose 
matrix is described in Example 2. All buffers for liquid . 

10 chromatography were degassed prior to use. The fusion protein 
H6FXtripb-UB was refolded by removing the urea and 2-mercap- 
toethanol from buffer II in a gradient over 5 column volumes 
into 50 mM Tris-HCl pH 8 and 0.5 M NaCl . After completion of 
the refolding procedure the H6FXtripb-UB fusion protein was 

15 eluted from the Ni^^'NTA- agarose columns with a buffer con- 
taining 0.5 M NaCl, 50 mM Tris-HCl, 25 mM EDTA pH 8 and gel 
filtrated on a Sephadex G50 column (Pharmacia) into 0.1 M 
Sodium borate pH 9 buffer for chemical cross -linking analysis 
with DMSI. 

20 The cross-linking analysis experiment was designed both to 
analyze the oligomeric status of the chimeric fusion protein 
and the thermal stability of the presumed fusion protein 
trimer as described in Example 2: Samples of 15 /xl H6FXtripb- 
UB fusion protein, at approximately 1.0 mg/ml concentration, 

25 were pre-incubated 10 min. at either 37°C, SO'^C, 60°C, or 
70«C before addition of 2 ^1 DMSI (20 mg/ml) . The reactions 
were allowed to proceed for 15 min. before quenching by 
addition of 5 ^il of 3 M Tris-HCl pH9.1 and the reaction 
mixtures were allowed to cool to room temperature. SDS-PAGE 

30 analysis of reduced samples (Fig. 12) showed, (1) that the 
fusion protein H6FXtripb-UB is a trimer in solution (monomer 
at 17 kDa, dimer at 35 kDa, and trimer at 43 kDa) and (2) 
that a substantial amount of trimer molecules is present even 
at 70 ^C. The appearance of other larger cross -linking pro- 

35 ducts is probably due to cross -linking of trimers via the 
ubiquitin part of the fusion protein. 
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EXAMPLE 4 

Design and construction of trimerised and hexamerized CEA6 
scFv antibodies scFvlCEAS) -TRIPE, TRIPE- scFviCEAS) and 
scFv(CEA6) -TRIPE' scFv (CEA6) . 

5 A plasmid clone, pUC19MCH/CEA6 , generously provided by Dr. 
Kevin Pritchard, Cambridge Antibody Technology Ltd., Mel- 
bourn, UK, harbouring a nucleotide sequence (SEQ ID: 20). 
encoding the CEA6 antibody in single -chain Fv (scFv) format, 
followed in sequence by a "myc tag" (which is a general 

10 purification/detection handle) , was used as template in 
Polymerase Chain Reactions (PCR) (Saiki et al., 1988) in 
which the nucleotide sequence encoding the scFv + myc tag was 
amplified using the primer pairs (SEQ ID: 21 and SEQ ID: 22) 
and (SEQ ID: 21 and SEQ ID: 23) to generate PCR fragments "A" 

15 and "B". 

PCR fragment "A" was treated with restrictions enzymes BamHI 
and Kpnl and the resulting fragment was inserted into 
Bglll/Kpnl cut pTripb (Example 1) to obtain the vector 
pTH6FXscFv(CEA6) -tripb (Fig. 15) encoding the HGFXscFv (CEA6) - 

20 TRIPB fusion protein (Fig. 16) . PCR fragment "B" was treated 
with restriction enzymes BamHI and Hindlll and the resulting 
fragment was inserted into BamHI and Hindlll cut pTripb 
(Example 1) to obtain the vector pTH6FXtripb-scFv(CEA6) (Fig. 
17) encoding the H6FXTRIPB-scFv(CEA6) fusion protein (Fig. 18, 

25 SEQ ID NO: 33) using standard procedures. 

To generate the expression vector pTH6FXscFv(CEA6) -tripb - 
scFv(CEA6) (Fig. 19) encoding the H6FXscFv(CEA6) -TRIPB- 
scFv(CEA6) fusion protein (Fig. 20, SEQ ID NO: 34) the insert 
in the vector pTH6FXtripb-scFv(CEA6) was excised using re- 
30 striction enzymes BamHI and Hindlll and inserted into the 
vector pTH6FXscFv(CEA6) -tripb, which had been treated with 
restriction enzymes BamHI and Hindlll. 
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To prepare the chimeric fusion proteins H6FXscFv(CEA6) -TRIPB 
(SEQ ID NO: 32), H6FXTRIPB-scFv {CEA6) (SEQ ID NO: 33) and 
H6FXscFv(CEA6) -TRIPB-SCFv(CEA6) (SEQ ID NO: 34) the plasmids 
pTH6FXscFv(CEA6) -TRIPB, pTH6FXtripb-scFv(CEA6) and 
5 pTH6FXscFv(CEA6) -tripb- scFv(CEA6) were grown in small scale 
(1 litre; 2xTY medium, 5 mM MgS04 and 100 fig ampicillin) in 
E. coli BL21 cells, as described by Studier et al. (1990). 
Exponentially growing cultures at 37oc were at ODgQo ^-^ 
infected with bacteriophage lambda CE6 at a multiplicity of 

10 approximately 5. Cultures were grown at 37 °C for another 

three hours and the cells harvested by cent rifugat ion. Cells 
were resuspended in 50 ml of 0.5 M NaCl, 50 mM Tris-HCl pH 8, 
and 1 mM EDTA pH 8, Phenol (50 ml adjusted to pH 8) was added 
to each and the mixtures were sonicated to extract total 

15 protein. After clarification by centrifugation (25 minutes at 
10.000 g) crude protein fractions were precipitated from the 
phenol phases by addition of 2.5 volumes of ethanol and 
centrifugation. Protein pellets were dissolved in a buffer 
(15-25 ml) containing 6 M guanidinium chloride, 50 mM 

20 Tris-HCl pH 8 and 0.1 M dithioerythriol . Following gel fil- 
tration on Sephadex G-25 (Pharmacia, Sweden) into 8 M Urea, 1 
M NaCl, 50 mM Tris-HCl pH 8 and 10 mM 2-mercaptoethanol, the 
crude protein preparations were applied to Ni^"*" activated 
NTA-agarose columns (75 ml column volume) for purification 

25 (Hochuli et al . , 1988). Washing buffer (6 M guanidine-HCl, 50 
mM Tris-HCl pH 8 and 10 mM 2-mercaptoethanol) was then flowed 
through the columns until stable baselines were obtained. 
Virtually pure fusion proteins could then be eluted by apply- 
ing a pH gradient to each column (1000 ml gradient in 8 M 

30 urea and 10 mM 2-mercaptoethanol obtained by linear (per 

volume) mixing of solutions containing 50 mM sodium di -hydro - 
genphosphate (pH 5 buffer) and 50 mM di-sodium hydr ogenphos - 
phate (pH 8 buffer) . 

In preparation for in vitro refolding by the method of Th0- 
35 gersen et al. (WO 94/18227) 20 mg of each purified fusion 
protein were mixed in suspensions in refolding "buffer B" 
(described below) with aliquots of suspensions of Ni^^ acti- 
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vated NTA-agarose matrix sufficient to generate columns of 
about 75 ml packed bed volume. Each fusion protein was then 
subjected to the iterative refolding procedure as described 
for plasminogen kringle 4 in the Thogersen et al. patent 
5 application (WO 94/18227) , except that refolding of the scFv 
containing fusion proteins was carried out at lO^'C using a 
buffer containing 0.5 M NaCl, 50 mM Tris-HCl pH 8, 2 mM 
glutathione and 0.2 mM oxidized glutathione as "buffer A" and 
a buffer containing 8 M urea, 1 M NaCl, 50 mM Tris-HCl pH 8 
10 and 2 mM glutathione as "buffer B" . 

After completion of the refolding procedure each column was 
washed with 300 ml buffer containing 0.5 M NaCl and 50 mM 
Tris-HCl pH 8 to wash away glutathione. The refolded fraction 
of each protein was then eluted from the NTA-agarose matrix 

15 by addition of 20 mM EDTA to the elution buffer. After addi- 
tion of solid urea to achieve a final concentration of about 
8 M to each protein sample and dilution or dialysis to reduce 
NaCl concentrations to below 5 mM, final purification of each 
correctly folded fusion protein product was then accomplished 

20 by ion exchange chromatography (S-Sepharose, Pharmacia, 1,6 
(i.d.) by 90 centimeter column in a buffer containing 8 M 
urea, 5 mM Tris-HCl (from 1 M stock solution at pH 8) and 25 
mM sodium acetate (from 1 M stock solution at pH 5) , eluted 
at 2 ml/min) . After dialysis against aqueous buffers (e.g. 

25 phosphate buffered saline) each pure and correctly refolded 
fusion protein was recovered in yields of 2-6 mg per litre of 
culture grown. Each protein may be shown by analytical gel 
filtration, chemical cross-linking analysis, by in vitro 
affinity measurements and by in vivo efficacy to form a 

30 stable homotrimeric molecular complex: The oligomeric status 
of the H6FXtripb-scFv- (CEA6) fusion protein was analyzed by 
chemical cross-linking analysis with DMSI: In parallel ex- 
periments, samples of H6FXtripb-scFv- {CEA6) at 0.34 mg/ml and 
TN123 at 0.28 mg/ml in 0 . 1 M Sodium borate were incubated at 

35 room temperature with increasing amounts (0 - 40 ixg in 12 /xl 
in total) of DMSI for 30 min. Reactions were quenched by 
addition of 5 /xl 3M Tris-HCl pH 9 and the samples analyzed by 
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SDS-PAGE under reducing conditions (Fig. 21) . Like tetranec- 
tin, the H6FXtripb-scFV- (CEA6) fusion protein, of approxi- 
mately 38 kDa, is hereby shown to be a trimer in solution. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT 

(A) NAME: Th0gersen, Hans Christian 

(B) STREET; Ristrupvej 41 

(C) CITY: Mundelstrup 

(D) STATE OR PROVINCE: 

(E) COUNTRY: Denmark 

(F) POSTAL CODE: 8381 

(ii) TITLE OF THE INVENTION: Tritnerising module 



(iii) NUMBER OF SEQUENCES: 34 

(iv) COMPUTER-READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(v) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
CCTGATCAAT CCAGGGAAGA TCTCCTGGTA CCGAGCCACC AACCCAG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CCAAGCTTAT TAGGATCCCG TCTGCAGGGC CTG 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCGAAGCTTA TTAGGATCCC TTCAGGGAGA CCGTCTGCAG 
(2) INFORMATION FOR SEQ ID.N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Gly Ser He Gin Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala Lys Lys Asp 

15 10 15 

Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 

20 25 30 

Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala Leu Gin Thr 

35 40 45 

Val Ser Leu Lys 
50 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala Lys Lys Asp 
15 10 15 
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GO 

Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 

20 25 30 

Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala Leu Gin Thr 
35 40 45 

Val 



(2) INFORMATION FOR SEQ ID NO: 7; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Glu Pro Pro Thr 


Gin 


Lys 


Pro 


Lys 


Lys He Val Asn Ala Lys Lys Asp 


1 


5 








10 15 


Val Val Asn Thr 


Lys 


Met 


Phe 


Glu 


Glu Leu Lys Ser Arg Leu Asp Thr 


20 










25 30 


Leu Ala Gin Glu 


Val 


Ala 


Leu 


Leu 


Lys Glu Gin Gin Ala Leu Gin Thr 


35 








40 


45 


Val Cys Leu Lys 


Gly 


Thr 


Lys 


Val 


His Met Lys Cys Phe Leu Ala Phe 


50 






55 




60 


Thr Gin Thr Lys 


Thr 


Phe 


His 


Glu 


Ala Ser Glu Asp Cys He Ser Arg 


65 




70 






75 80 


Gly Gly Thr Leu 


Ser 


Thr 


Pro 


Gin 


Thr Gly Ser Glu Asn Asp Ala Leu 


85 








90 95 


Tyr Glu Tyr Leu 


Arg 


Gin 


Ser 


Val 


Gly Asn Glu Ala Glu He Trp Leu 


100 










105 110 


Gly Leu Asn Asp 


Met 


Ala 


Ala 


Glu 


Gly Thr Trp Val Asp Met Thr Gly 


115 








120 


125 


Ala Arg He Ala 


Tyr 


Lys 


Asn 


Trp 


Glu Thr Glu He Thr Ala Gin Pro 


130 






135 




140 


Asp Gly Gly Lys 


Thr 


Glu 


Asn 


Cys 


Ala Val Leu Ser Gly Ala Ala Asn 


145 




150 






155 160 


Gly Lys Trp Phe 


Asp 
165 


Lys 


Arg 


Cys 


Arg Asp Gin Leu Pro Tyr He Cys 
170 175 


Gin Phe Gly He 


Val 











180 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCTGGATCCA TCGAGGGTAG GGGCGAGCCA CCAACCCAG 



(2) INFORMATION FOR SEQ ID NO: 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCGAAGCTTA CACGATCCCG AACTG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHJ^CTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Ser lie Glu Gly Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Val Arg Ala Asn Lys Arg Asn Glu Ala Leu Arg lie Glu Ser Ala 

15 10 15 

Leu Leu Asn Lys He Ala Met Leu Gly Thr Glu Lys Thr Ala Glu Gly 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Gly Ser His His His His His His Gly Ser 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGAAGCTTA GACCGTCTGC AGGGC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGCGGATCCA TCCAGGGTAG GGTTGTGAAC ACAAAGATG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCTGGATCCA TCGAGGGTAG GGCCCTGCAG ACGGTC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATGCAGATCT TTGTGAAGAC CCTCACTGGC AAAACCATCA CCCTTGAGGT OGAGCCCAGT . 60 

GACACCATTG AGAATGTCAA AGCCAAAATT CAAGACAAGG AGGGTATCCC ACCTGACCGC 120 

AGCGTCTGAT ATTTGCCGGC AAACAGCTGG AAGATGGACG TACTTTGTCT GACTACAATA 180 

TTCAAAAGGA GTCTACTCTT CATCTTGTGT TGAGACTTCG TGGTGGT 227 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TGCTGATCAC AGATCTTTGT GAAGACC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CGCAAGCTTG CATGCTTAGG ATCCACCACG AAGTCTCAA 39 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Gin He Phe Val Lys Thr Leu Thr Gly Lys Thr He Thr Leu Glu 

15 10 15 

Val Glu Pro Ser Asp Thr He Glu Asn Val Lys Ala Lys He Gin Asp 

20 25 30 

Lys Glu Gly He Pro Pro Asp Gin Gin Arg Leu He Phe Ala Gly Lys 

35 40 45 

Gin Leu Glu Asp Gly Arg Thr Leu Ser Asp Tyr Asn He Gin Lys Glu 

50 55 60 

Ser Thr Leu His Leu Val Leu Arg Leu Arg Gly Gly 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CAGGTTCAGC TGCAGCAGTC AGGGGCTGAG GTGAAGAAGC CTGGGTCCTC GGTGAAGGTC 60 



L 
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TCCTGCAAGG CTTCTGGAGG CACCTTCAGC AACTCTCCTA TCAACTGGCT GCGACAGGCC 120 

CCCGGACAAG GGCTTGAGTG GATGGGAAGT ATCATCCCTT CCTTTGGTAC AGCAAACTAC 180 

GCTCAGAAGT TCCAGGGCAG ACTCACX3ATT ACCGCGGACG AATCCACGAG CACAGCCTAC 240 

ATGGAGCTGA GCAGCCTGAG ATCTGAGGAC ACGGCCGTGT ATTACTGTGC GGGGCGGAGC 300 

CACAACTACG AACTCTACTA TTACTACATG GACGTCTGGG GCCAGGGGAC AATGGTCACC 360 

GTCTCGAGTG GTGGAGGCGG TTCAGGCGGA GGTGGCAGCG GCGGTGGCGG ATCGGACATC 420 

CAGATGACCC AGTCTCCTTC CACCCTGTCT GCATCTATTG GAGACAGAGT CACCATCACC 480 

TGCCGGGCCA GTGAGGGTAT TTATCACTGG TTGGCCTGGT ATCAGCAGAA GCCAGGGAAA 540 

GCCCCTAAAC TCCTGATCTA TAAGGCCTCT AGTTTAGCCA GTGGGGCCCC ATCAAGGTTC 600 

AGCGGCAGTG GATCTGGGAC AGATTTCACT CTCACCATCA GCAGCCTGCA GCCTGATGAT 660 

TTTGCAACTT ATTACTGCCA ACAATATAGT AATTATCCGC TCACTTTCGG CGGAGGGACC 720 

AAGCTG6AGA TCAAACGTGC GGCCGCAGAA CAAAAACTCA TCTCAGAAGA GGATCTGAAT 780 

GGGGCC "^^.^ 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GGTG6ATCCC AGGTTCAGCT GCAGC 25 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCCGGTACCG GCCCCATTCA GATCC 25 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TCCAAGCTTA GGCCCCATTC AGATCC 26 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 

(B) TYPE: amino acid 



L 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Val Arg Ala Asn Lys Arg Asn Glu .Ala Leu Arg He Glu Ser Ala 

1 5 10-15 

Leu Leu Asn Lys He Ala Met Leu Gly Thr Glu Lys Thr Ala Glu Gly 

20 25 30 

Gly Ser His His His His His His Gly Ser He Glu Gly Arg Gly Glu 

35 40 45 

Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala Lys Lys Asp Val 

50 55 60 

Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 
65 70 75 80 

Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala Leu Gin Thr Val 

85 90 95 

Cys Leu Lys Gly Thr Lys Val His Met Lys Cys Phe Leu Ala Phe Thr 

100 105 110 

Gin Thr Lys Thr Phe His Glu Ala Ser Glu Asp Cys He Ser Arg Gly 

115 120 125 

Gly Thr Leu Ser Thr Pro Gin Thr Gly Ser Glu Asn Asp Ala Leu Tyr 

130 135 140 

Glu Tyr Leu Arg Gin Ser Val Gly Asn Glu Ala Glu He Trp Leu Gly 
145 150 155 160 

Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val Asp Met Thr Gly Ala 

165 170 175 

Arg He Ala Tyr Lys Asn Trp Glu Thr Glu He Thr Ala Gin Pro Asp 

180 185 190 

Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser Gly Ala Ala Asn Gly 

195 200 205 

Lys Trp Phe Asp Lys Arg Cys Arg Asp Gin Leu Pro Tyr He Cys Gin 

210 215 220 

Phe Gly He Val 
225 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 197 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Gly Ser His His His His His His Gly Ser He Glu Gly Arg Gly 

15 10 15 

Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala Lys Lys Asp 

20 • 25 30 

Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 

35 40 45 

Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala Leu Gin Thr 
50 55 60 
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Val Cys Leu 


Lys 


Gly 


Thr 


Lys 


Val 


His 


Met 


Lys Cys Phe 


Leu 


Ala 


Pile 


65 






70 










75 






ou 


Thr Gin Thr 


Lys 


Thr 


Phe 


His 


Glu 


Ala 


Ser 


Glu Asp Cys 


He 


Ser 


Arg 






85 










90 










Gly Gly Thr 


Leu 


Ser 


Thr 


Pro 


Gin 


Thr 


Gly 


Ser Glu Asn 


Asp 


Ala 


Leu 




100 










105 






110 






Tyr Glu Tyr 


Leu 


Arg 


Gin 


Ser 


Val 


Gly 


Asn 


Glu Ala Glu 


He 


Trp 


Leu 


115 










120 






125 








Gly Leu Asn 


Asp 


Met 


Ala 


Ala 


Glu 


Gly 


Thr 


Trp Val Asp 


Met 


Thr 


Gly 


130 








135 








140 








Ala Arg He 


Ala 


Tyr 


Lys 


Asn 


Trp 


Glu 


Thr 


Glu lie "Thr 


Ala 


Gin 


Pro 


145 






150 










155 








Asp Gly Gly 


Lys 


Thr 


Glu 


Asn 


Cys 


Ala 


Val 


Leu Ser Gly 


Ala 


Ala 


Asn 






165 










170 






175 




Gly Lys Trp 


Phe 


Asp 


Lys 


Arg 


Cys 


Arg 


Asp 


Gin Leu Pro 


Tyr 


He 


Cys 




180 










185 






190 






Gin Phe Gly 


He 


Val 




















195 

























(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Met Gly Ser His His His His His His Gly Ser He Glu Gly Arg Gly 

15 10 15 

Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala Lys Lys Asp 

20 25 30 

Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr 

35 40 45 

Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala Leu Gin Thr 
50 55 60 

Val 
65 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Gly Ser His His His His His His Gly Ser He Gin Gly Arg Val 

15 10 15 

Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg Leu Asp Thr Leu 
20 25 30 
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Ala Gin Glu 


Val 


Ala 


Leu 


Leu 


Lys 


Glu 


Gin Gin Ala Leu um inr vax 


35 










40 






Cys Leu Lys 


Gly 


Thr 


Lys 


Val 


His 


Met 


Lys Cys Pne Leu ax a fne inr 


50 








55 






oU 


Gin Thr Lys 


Thr 


Phe 


His 


Glu 


Ala 


Ser 


Glu Asp Cys He Ser Arg Gly 


65 






70 








nc AH 


Gly Thr Leu 


Ser 


Thr 


Pro 


Gin 


Thr 


Gly 


Ser Glu Asn Asp Aia Leu lyr 




85 










90 95 


Glu Tyr Leu 


Arg 


Gin 


Ser 


Val 


Gly 


Asn 


Glu Ala Glu lie Trp Leu Lr±y 


100 










105 


Tin 
110 


Leu Asn Asp 


Met 


Ala 


Ala 


Glu 


Gly 


Thr 


Trp Val Asp Met Tnr biy Axa 


115 










120 




IOC 

12b 


Arg He Ala 


Tyr 


Lys 


Asn 


Trp 


Glu 


Thr 


Glu lie inr Aia vjjlii fitj 


130 








135 






140 


Gly Gly Lys 


Thr 


Glu 


Asn 


cys 


Ala 


Val 


Leu Ser Gly Ala Ala Asn Gly 


145 






150 








155 160 


Lys Trp Phe 


Asp 


Lys 


Arg 


Cys 


Arg 


Asp 


Gin Leu Pro Tyr He Cys Gin 




165 










170 175 


Phe Gly He 


Val 
180 















(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Gly Ser His His His His His His Gly Ser He Gin Gly Arg Ser 

15 10 15 

Pro Gly Thr Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala 

20 25 30 

Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser Arg 

35 40 45 

Leu Asp Thr Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin Ala 

50 55 60 

Leu Gin Thr Val Ser Leu Lys Gly Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Met Gly Ser His His His His His His Gly Ser He Gin Gly Arg Ser 
15 10 15 
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Pro Gly Thr Glu Pro Pro Thr Gin 
20 

Lys Lys Asp Val Val Asn Thr Lys 

35 40 
Leu Asp Thr Leu Ala Gin Glu Val 

50 55 
Leu Gin Thr Gly Ser 
65 



Lys Pro Lys Lys lie Val Asn Ala 
25 30 
Met Phe Glu Glu Leu Lys Ser Arg 
45 

Ala Leu Leu Lys Glu Gin Gin Ala 
60 



(2) INFORMATION FOR SEQ ID N0:30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 



Met 


Gly 


Ser His 


His 


His 


His 


His His Gly Ser He Glu Gly Arg Ala 


1 






5 






10 15 


Leu 


Gin 


Thr Val 


Cys 


Leu 


Lys 


Gly Thr Lys Val His Met Lys Cys Phe 






20 








25 30 


Leu 


Ala 


Phe Thr 


Gin 


Thr 


Lys 


Thr Phe His Glu Ala Ser Glu Asp Cys 






35 








40 45 


He 


Ser 


Arg Gly 


Gly 


Thr 


Leu 


Ser Thr Pro Gin Thr Gly Ser Glu Asn 




50 








55 


60 


Asp Ala 


Leu Tyr 


Glu 


Tyr 


Leu 


Arg Gin Ser Val Gly Asn Glu Ala Glu 


65 








70 




75 80 


He 


Trp 


Leu Gly 


Leu 


Asn 


Asp 


Met Ala Ala Glu Gly Thr Trp Val Asp 








85 






90 95 


Met 


Thr 


Gly Ala 


Arg 


He 


Ala 


Tyr Lys Asn Trp Glu Thr Glu He Thr 






100 








105 110 


Ma 


Gin 


Pro Asp 


Gly 


Gly 


Lys 


Thr Glu Asn Cys Ala Val Leu Ser Gly 






115 








120 125 


Ala 


Ala 


Asn Gly 


Lys 


Trp 


Phe 


Asp Lys Arg Cys Arg Asp Gin Leu Pro 




130 








135 


140 


Tyr 


He 


Cys Gin 


Phe 


Gly 


He 


Val 


145 








150 







(2) INFORMATION FOR SEQ ID N0:31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Gly Ser His His His His His His Gly Ser He Gin Gly Arg Ser 

15 10 15 

Pro Gly Thr Glu Pro Pro Thr Gin Lys Pro Lys Lys He Val Asn Ala 
20 25 30 
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Lys Lys Asp Val Val 

35 

Leu Asp Thr Leu Ala 
50 

Leu Gin Thr Gly Ser 
65 

He Thr Leu Glu Val 
85 

Lys He Gin Asp Lys 
100 

Phe Ala Gly Lys Gin 
115 

He Gin Lys Glu Ser 
130 

Ser 
145 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



Met Gly 


Ser 


His 


His 


His 


His 


His His 


Gly 


Ser 


He Gin Gly Arg Ser 


1 






5 








10 




15 


Gin Val 


Gin 


Leu 


Gin 


Gin 


Ser 


Gly Ala 


Glu 


Val 


Lys Lys Pro Gly Ser 






20 








25 






30 


Ser Val 


Lys 


Val 


Ser 


Cys 


Lys 


Ala Ser 


Gly 


Gly 


Thr Phe Ser Asn Ser 




35 










40 






45 


Pro He 


Asn 


Trp 


Leu Arg 


Gin 


Ala Pro 


Gly 


Gin 


Gly Leu Glu Trp Met 


50 










55 








60 


Gly Ser 


He 


He 


Pro 


Ser 


Phe 


Gly Thr 


Ala 


Asn 


Tyr Ala Gin Lys Phe 


65 








70 








75 


80 


Gin Gly 


Arg 


Leu 


Thr 


He 


Thr 


Ala Asp 


Glu 


Ser 


Thr Ser Thr Ala Tyr 








85 








90 




95 


Met Glu 


Leu 


Ser 


Ser 


Leu 


Arg 


Ser Glu 


Asp 


Thr 


Ala Val Tyr Tyr Cys 






100 








105 






110 


Ala Gly 


Arg 


Ser 


His 


Asn 


Tyr 


Glu Leu 


Tyr 


Tyr 


Tyr Tyr Met Asp Val 




115 










120 






125 


Trp Gly 


Gin 


Gly 


Thr 


Met 


Val 


Thr Val 


Ser 


Ser 


Gly Gly Gly Gly Ser 


130 










135 








140 


Gly Gly 


Gly 


Gly 


Ser Gly 


Gly 


Gly Gly 


Ser 


Asp 


He Gin Met Thr Gin 


145 








150 








155 


160 


Ser Pro 


Ser 


Thr 


Leu 


Ser 


Ala 


Ser He 


Gly 


Asp 


Arg Val Thr He Thr 








165 








170 




175 


Cys Arg 


Ala 


Ser 


Glu Gly 


He 


Tyr His 


Trp 


Leu 


Ala Trp Tyr Gin Gin 






180 








185 






190 


Lys Pro 


Gly 


Lys 


Ala 


Pro 


Lys 


Leu Leu 


He 


Tyr 


Lys Ala Ser Ser Leu 




195 










200 






205 


Ala Ser 


Gly 


Ala 


Pro 


Ser 


Arg 


Phe Ser 


Gly 


Ser 


Gly Ser Gly Thr Asp 


210 










215 








220 



Asn Thr 

Gin Glu 

55 
Gin He 
70 

Glu Pro 
Glu Gly 
Leu Glu 
Thr Leu 



Lys Met 

40 

Val Ala 

Phe Val 

Ser Asp 

He Pro 
105 
Asp Gly 
120 

His Leu 



Phe Glu Glu 

Leu Leu Lys 
60 

Lys Thr Leu 
75 

Thr He Glu 
.90 

Pro Asp Gin 
Arg Thr Leu 
Val Leu Arg 



Leu Lys 

45 

Glu Gin 
Thr Gly 
Asn Val 

Gin Arg 

110 
Ser Asp 
125 

Leu Arg 



Ser Arg 

Gin Ala 

Lys Thr 

80 
Lys Ala 
95 

Leu He 
Tyr Asn 
Gly Gly 
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Phe 


Thr 


Leu 


Thr 


He 


Ser 


Ser 


Leu Gin 


Pro Asp 


Asp 


Phe Ala Thr Tyr 


225 










230 








235 




240 


Tyr 


Cys 


Gin 


Gin 


Tyr 


Ser 


Asn 


Tyr Pro 


Leu 


Thr 


Phe Gly Gly Gly Thr 






245 








250 






255 


Lys 


Leu 


Glu 


He 


Lys 


Arg 


Ala 


Ala Ala 


Glu 


Gin 


Lys 


Leu He Ser Glu 






260 








265 








270 


Glu Asp 


Leu 


Asn 


Gly 


Ala 


Gly 


Thr Glu 


Pro 


Pro 


Thr 


Gin Lys Pro Lys 






275 










280 








285 


Lys 


He 


Val 


Asn 


Ala 


Lys 


Lys 


Asp Val 


Val 


Asn 


Thr 


Lys Met Phe Glu 




290 










295 








300 




Glu 


Leu 


Lys 


Ser 


Arg 


Leu 


Asp 


Thr Leu 


Ala 


Gin 


Glu 


Val Ala Leu Leu 


305 










310 








315 




320 


Lys 


Glu 


Gin Gin 


Ala 


Leu 


Gin 


Thr Gly 


Ser 














325 








330 









(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



Met Gly 


Ser 


His 


His 


His 


His 


His His Gly Ser 


He Gin Gly Arg Ser 


1 






5 








10 


15 


Pro Gly 


Thr 


Glu 


Pro 


Pro 


Thr 


Gin Lys 


Pro Lys 


Lys He Val Asn Ala 




20 








25 




30 


Lys Lys 


Asp 


Val 


Val 


Asn 


Thr 


Lys Met 


Phe Glu 


Glu Leu Lys Ser Arg 


35 










40 




45 


Leu Asp 


Thr 


Leu 


Ala 


Gin 


Glu 


Val Ala 


Leu Leu 


Lys Glu Gin Gin Ala 


50 










55 






60 


Leu Gin 


Thr 


Gly 


Ser 


Gin 


Val 


Gin Leu 


Gin Gin 


Ser Gly Ala Glu Val 


65 






70 






75 


80 


Lys Lys 


Pro 


Gly 


Ser 


Ser 


Val 


Lys Val 


Ser Cys 


Lys Ala Ser Gly Gly 






85 








90 


95 


Thr Phe 


Ser 


Asn 


Ser 


Pro 


He 


Asn Trp 


Leu Arg 


Gin Ala Pro Gly Gin 






100 








105 




110 


Gly Leu 


Glu 


Trp 


Met 


Gly 


Ser 


He He 


Pro Ser 


Phe Gly Thr Ala Asn 




115 










120 




125 


Tyr Ala 


Gin 


Lys 


Phe 


Gin 


Gly 


Arg Leu 


Thr He 


Thr Ala Asp Glu Ser 


130 










135 






140 


Thr Ser 


Thr 


Ala 


Tyr Met 


Glu 


Leu Ser 


Ser Leu 


Arg Ser Glu Asp Thr 


145 








150 






155 


160 


Ala Val 


Tyr 


Tyr 


Cys 


Ala 


Gly 


Arg Ser 


His Asn 


Tyr Glu Leu Tyr Tyr 




165 








170 


175 


Tyr Tyr 


Met 


Asp 


Val Trp 


Gly 


Gin Gly Thr Met 


Val Thr Val Ser Ser 






180 








185 




190 


Gly Gly 


Gly 


Gly 


Ser Gly 


Gly 


Gly Gly Ser Gly 


Gly Gly Gly Ser Asp 




195 










200 




205 


He Gin 


Met 


Thr 


Gin 


Ser 


Pro 


Ser Thr 


Leu Ser 


Ala Ser He Gly Asp 


210 










215 






220 


Arg Val 


Thr 


He 


Thr Cys 


Arg 


Ala Ser Glu Gly 


He Tyr His Trp Leu 


225 








230 






235 


240 
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Ala Trp Tyr 


Gin 


Gin Lys 


Pro Gly Lys 


Ala Pro 


Lys Leu Leu He Tyr 




245 




250 




Lys Ala Ser 


Ser 


Leu Ala 


Ser Gly Ala 


Pro Ser 


Arg Phe Ser Gly Ser 


260 




265 




270 


Gly Ser Gly 


Thr 


Asp Phe 


Thr Leu Thr 


He Ser 


Ser Leu oin rro /vsp 


275 






280 




285 


Asp Phe Ala 


Thr 


Tyr Tyr 


Cys Gin Gin 


Tyr Ser 


Asn Tyr Pro Leu Thr 


290 






295 




300 


Phe Gly Gly 


Gly 


Thr Lys 


Leu Glu lie 


Lys Arg 


Ala Ala Ala Glu Gin 


305 




310 




315 


320 


Lys Leu lie 


Ser 


Glu Glu 


Asp Leu Asn 


Gly Ala 






325 




330 





(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Gly Ser His His His His His His Gly Ser He Gin Gly Arg Ser 

15 10 15 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ser 

20 25 30 

Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Thr Phe Ser Asn Ser 

35 40 45 

Pro He Asn Trp Leu Arg Gin Ala Pro Gly Gin Gly Leu Glu Trp Met 

50 55 60 

Gly Ser He He Pro Ser Phe Gly Thr Ala Asn Tyr Ala Gin Lys Phe 
65 70 75 80 

Gin Gly Arg Leu Thr He Thr Ala Asp Glu Ser Thr Ser Thr Ala Tyr 

85 90 95 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 

100 105 110 

Ala Gly Arg Ser His Asn Tyr Glu Leu Tyr Tyr Tyr Tyr Met Asp Val 

115 120 125 

Trp Gly Gin Gly Thr Met Val Thr Val Ser Ser Gly Gly Gly Gly Ser 

130 135 140 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp He Gin Met Thr Gin 
145 150 155 160 

Ser Pro Ser Thr Leu Ser Ala Ser He Gly Asp Arg Val Thr He Thr 

165 170 175 

Cys Arg Ala Ser Glu Gly He Tyr His Trp Leu Ala Trp Tyr Gin Gin 

180 185 190 

Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Lys Ala Ser Ser Leu 

195 200 205 

Ala Ser Gly Ala Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 

210 215 220 

Phe Thr Leu Thr He Ser Ser Leu Gin Pro Asp Asp Phe Ala Thr Tyr 
225 230 235 240 

Tyr Cys Gin Gin Tyr Ser Asn Tyr Pro Leu Thr Phe Gly Gly Gly Thr 
245 250 255 
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Lys Leu Glu 

Glu Asp Leu 
275 

Lys He Val 

290 
Glu Leu Lys 

305 

Lys Glu Gin 

Ser Gly Ala 

Lys Ala Ser 
355 

Gin Ala Pro 

370 
Phe Gly Thr 
385 

Thr Ala Asp 

Arg Ser Glu 

Tyr Glu Leu 
435 

Val Thr Val 

450 
Gly Gly Gly 
465 

Ala Ser He 

He Tyr His 

Lys Leu Leu 
515 

Arg Phe Ser 

530 
Ser Leu Gin 
545 

Asn Tyr Pro 
Ala Ala Ala 



He Lys Arg Ala Ala Ala Glu Gin Lys 
260 265 
Asn Gly Ala Gly Thr Glu Pro Pro Thr 
280 

Asn Ala Lys Lys Asp Val Val Asn Thr 
295 300 
Ser Arg Leu Asp Thr Leu Ala Gin Glu 

310 315 
Gin Ala Leu Gin Thr Gly Ser Gin Val 

325 330 
Glu Val Lys Lys Pro Gly Ser Ser Val 
340 345 
Gly Gly Thr Phe Ser Asn Ser Pro He 
360 

Gly Gin Gly Leu Glu Trp Met Gly Ser 
375 380 
Ala Asn Tyr Ala Gin Lys Phe Gin Gly 

390 395 
Glu Ser Thr Ser Thr Ala Tyr Met Glu 

405 410 
Asp Thr Ala Val Tyr Tyr Cys Ala Gly 
420 425 
Tyr Tyr Tyr Tyr Met Asp Val Trp Gly 
440 

Ser Ser Gly Gly Gly Gly Ser Gly Gly 
455 460 
Ser Asp He Gin Met Thr Gin Ser Pro 

470 475 
Gly Asp Arg Val Thr He Thr Cys Arg 

485 490 
Trp Leu Ala Trp Tyr Gin Gin Lys Pro 
500 505 
He Tyr Lys Ala Ser Ser Leu Ala Ser 
520 

Gly Ser Gly Ser Gly Thr Asp Phe Thr 
535 540 
Pro Asp Asp Phe Ala Thr Tyr Tyr Cys 

550 555 
Leu Thr Phe Gly Gly Gly Thr Lys Leu 

565 570 
Glu Gin Lys Leu He Ser Glu Glu Asp 
580 585 



Leu He Ser Glu 

270 

Gin Lys Pro Lys 
285 

Lys Met Phe Glu 

Val Ala Leu Leu 
320 

Gin Leu Gin Gin 

335 

Lys Val Ser Cys 
350 

Asn Trp Leu Arg 
365 

He He Pro Ser 

Arg Leu Thr He 
400 

Leu Ser Ser Leu 
415 

Arg Ser His Asn 
430 

Gin Gly Thr Met 
445 

Gly Gly Ser Gly 

Ser Thr Leu Ser 
480 

Ala Ser Glu Gly 
495 

Gly Lys Ala Pro 
510 

Gly Ala Pro Ser 
525 

Leu Thr He Ser 

Gin Gin Tyr Ser 

560 

Glu He Lys Arg 
575 

Leu Asn Gly Ala 
590 
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1. A monomer polypeptide construct comprising at least one 
tetranectin trimerising structural element (TTSE) which is 
covalently linked to at least one heterologous moiety, said 
5 TTSE being capable of forming a stable complex with two other 
TTSEs, with the proviso that the heterologous moiety is 
different from any of the fusion proteins CIIH6FXTN123, 
H6FXTN123, H6FXTN12, H6FXTN23 , the sequences of which are 
shown in SEQ ID NOs : 24-27. 

10 2. A monomer polypeptide construct comprising at least one 
tetranectin trimerising structural element (TTSE) which is 
covalently linked to at least one heterologous moiety, said 
TTSE being capable of forming a stable complex with two other 
TTSEs, said at least one heterologous moiety being one which 

15 does not exclusively facilitate expression and/or purifica- 
tion of the monomer polypeptide construct. 

3. A monomer polypeptide construct according to claim 1 or 2, 
wherein the stable complex includes a triple alpha helical 
coiled coil. 

20 4. A monomer polypeptide construct according to any of the 

preceding claims, wherein the heterologous moiety is selected 
from the group consisting of a ligand binding structure; a 
toxin; a detectable label; an in situ activatable substance; 
an enzyme; a radioactive moiety; a cytokine; a non-protein- 

25 aceous polymer such as a polymeric alkaloid, a polyalcohol, a 
polysaccharide, a lipid and a polyamine; a photo cross -lin- 
king agent; and a group facilitating conjugation of the 
monomer polypeptide construct to a target. 

5. A monomer polypeptide construct according to any of the 
30 preceding claims, which comprises 2 TTSEs which are covalent- 
ly linked by a spacer moiety which allows both of the 2 TTSEs 
to take part in complex formation with a third TTSE not being 
part of the monomer polypeptide construct. 
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6. A monomer polypeptide construct according to claim 5, 
wherein the spacer moiety has a length and a conformation 
which favours complex formation involving both of the two 
TTSEs which are covalently linked by the spacer moiety . 

5 7 . A monomer polypeptide construct according to claim 5 or 6 , 
wherein the spacer moiety is a polypeptide fragment. 

8. A monomer polypeptide construct according to any of claims 
1-4, which comprises one single TTSE. 

9. A monomer polypeptide construct according to any of the 
10 preceding claims, wherein the TTSE is derived from human 

tetranectin, murine tetranectin, C-type lectin of bovine 
cartilage, or C-type lectin of shark cartilage. 

10. A monomer polypeptide construct according to claim 9, 
wherein the TTSE comprises a polypeptide sequence which has 

15 at least 68% sequence identity with the consensus sequence 
shown in Fig ,2. 

11. A monomer polypeptide construct according to claim 10, 
wherein the sequence identity with the consensus sequence is 
at least 75%, such as at least 81%, at least 87%, or at least 

20 92%. 

12. A monomer polypeptide construct according to any of 
claims 9-11, wherein at least one amino acid residue selected 
from the group consisting of amino acid residue nos. 6, 21, 
22, 24, 25, 27, 28, 31, 32, 35, 39, 41, 42, is/are substitu- 

25 ted by any non-helix breaking amino acid residue, the amino 
acid residue numbering referring to amino acid residues in 
SEQ ID NO: 7. 

13 . A monomer polypeptide construct according to any of the 
preceding claims, wherein the at least one TTSE comprises a 

30 repeated heptad having the formula a-b-c-d-e-f -g (N to C) , 



wo 98/56906 PCT/DK98/00245 

75 

wherein a majority of the amino acids residues a and d are 
hydrophobic amino acids . 

14. A monomer polypeptide construct according to claim 13, 
wherein heptad is repeated 3 times and wherein the last 

5 occurrence of the heptad has a glutamine residue correspon- 
ding to residues a and d. 

15. A monomer polypeptide construct according to any of the 
preceding claims, wherein the at least one heterologous 
moiety is covalently linked to the TTSE via a peptide bond to 

10 the N- or C-terminus of the TTSE peptide chain, via a peptide 
bond to a side chain in the TTSE, via a bond to a cysteine 
residue, or when more than one heterologous moity, 
combinations of these locations . 

16 . A monomer polypeptide construct according to any of the 
15 preceding claims which lacks any free amino and/or carboxy 

groups . 

17. A monomer polypeptide construct according to any of the 
preceding claims which lacks a substantial part of the N- 
terminal region of tetranectin which is encoded by exon 1. 

20 18. A monomer polypeptide construct according to any of the 
preceding claims comprising two heterologous moieties which 
are linked via peptide bonds to the N- and C-terminus, re- 
spectively. 

19. A monomer polypeptide construct according to any of the 
25 preceding claims which is constructed so as to disfavour 

formation of complexes between identical TTSEs. 

20. An oligomer which is comprised of two monomer poLypeptide 
constructs according to any of claims 1-19, and which com- 
prises at three TTSE's or a multiplum of three TTSE's, or 

30 which is comprised of three monomer polypeptide constructs 
according to any of claims 1-4 or 8-19. 
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21. An oligomer according to claim 20 which is stable in the 
temperature range 50-70**C. 

22. An oligomer according to claim 20 or 21, which comprises 
at least one heterologous moiety which is positioned N-termi- 

5 nally to a TTSE and at least one heterologous moiety which is 
positioned C-terminally to a TTSE. 

23. An oligomer according to claim 22, wherein the at least 
one heterologous moiety which is positioned N- terminally to a 
TTSE and the at least one heterologous moiety which is posi- 

10 tioned C- terminally to a TTSE are part of the same monomeric 
polypeptide construct. 

24. An oligomer according to claim 22, wherein the at least 
one heterologous moiety which is positioned N-terminally to a 
TTSE and the at least one heterologous moiety which is posi- 

15 tioned C-terminally to a TTSE are part of two separate mono- 
meric polypeptide constructs. 

25. An oligomer according to any of claims 20-24, wherein 
each monomer polypeptide construct is designed so as to 
disfavour formation of trimers including two monomer polypep- 

20 tide constructs having identical TTSEs. 

26. A method of preparing a monomer polypeptide construct 
according to any of claims 1-19, the method comprising 

isolating the monomer polypeptide construct from a cul- 
ture comprising a host cell which carries and expresses a 
25 nucleic acid fragment which encodes the monomer polypep- 

tide construct, or 

synthesizing, by means of chemical peptide synthesis, the 
monomer polypeptide construct and subsequently isolating 
the monomer polypeptide construct from the reaction 
30 mixture, or 
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preparing a TTSE in a culture comprising a host cell 
which carries and expresses a nucleic acid fragment which 
encodes the TTSE, subsequently linking covalently at 
least one heterologous moiety to the TTSE, and thereafter 
5 isolating the resulting monomer polypeptide construct, or 

synthesizing, by means of chemical peptide synthesis, a 
TTSE, subsequently linking covalently at least one hete- 
rologous moiety to the TTSE, and thereafter the isolating 
the resulting monomer polypeptide construct from the 
10 reaction mixture, 

and optionally subjecting the monomer polypeptide construct 
to further processing. 

27. A method for preparing a dimeric oligomer according to 
claim 20 which comprises 

15 - admixing a monomer polypeptide construct according to any 
of claims 1-19 which includes two TTSEs (construct 1) 
with a monomer polypeptide construct according to any of 
claims 1-4 or 8-19 which includes only one TTSE (con- 
struct 2) , 

20 - effecting the two TTSE's of construct 1 to complex with 
the TTSE of construct 2, and 

isolating the resulting dimer and optionally subjecting 
the dimer to further processing. 

28. A method for preparing a trimeric oligomer according to 
25 claim 20 which comprises 

admixing three monomer polypeptide constructs according 
to any of claims 1-19 with each other, 

effecting complex formation between one TTSE of each 
monomer polypeptide construct, and 
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isolating the resulting trimer and optionally, subjecting 
the trimeric oligomer to further processing. 



29, A kit comprising 

a first package comprising at least one container means, 
5 each at least one container means containing a monomer 

polypeptide construct according to any of claims 1-19, 

a second package comprising at least one container means, 
each at least one container means in the second package 
containing a monomer polypeptide construct according to 
10 any of claims 1-19, the second package being different 

from the first package with respect to choice and/or 
number of monomer polypeptide constructs included there- 
in, and optionally 



a third package comprising at least one container means, 
15 each at least one container means in the third package 

containing a monomer polypeptide construct according to 
any of claims 1-19, the second package being different 
from the first and second packages with respect to choice 
and/or number of monomer polypeptide constructs included 
20 therein. 



30. A kit according to claim 29, wherein the at least one 
container means in each package contains mutually distinct 
monomer polypeptide constructs. 

31. A kit according to claim 29 or 30, wherein all container 
25 means comprised in the kit comprises mutually distinct poly- 
peptide constructs . 

32. A nucleic acid fragment in isolated form which encodes a 
TTSE as defined in any of claims 1-19 or which encodes the 
polypeptide part of a monomer polypeptide construct according 

30 to any of claims 1-19, with the proviso that the nucleic acid 
fragment is different from one that encodes native members of 



wo 98/56906 PCT/DK98/00245 

79 

the tetranectin family, and that the nucleic acid fragment is 
different from one that encodes any of the fusion proteins 
CIIH6FXTN123, H6FXTN123, H6FXTN12, H6FXTN23, the sequences of 
which are shown in SEQ ID NOs: 24-27. 

5 33. A replicable vector which comprises a nucleic acid frag- 
ment according to claim 32. 

34. A transformed host cell, which comprises a nucleic acid 
fragment according to claim 32 or a replicable vector accor- 
ding to claim 32 . 

10 35. Use of a monomer polypeptide construct according to any 
of claims 1-19 or to a an oligomer construct according to any 
of claims 20-25 for targeted gene therapy involving selective 
delivery of the material for transfection or infection of the 
specific population of cells. 

15 36. The use according to claim 35 wherein the at least one 
heterologous moity comprises a moiety selected from a ligand 
binding structure such as a receptor molecule or the ligand 
binding part of a receptor molecule, and wherein the gene 
therapy involves the delivery of nucleic acids to the desired 

20 population of cells by use of a viral vector directed to 
cells displaying the artificial receptor complex 
corresponding to the heterologous moity. 

37. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a an oligomer according to any of 

25 claims 20-25 as a component of a chimaeric product having low 
antigenicity in humans relative to formulations comprising on 
or more components of non-human origin. 

38. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a an oligomer according to any of 

30 claims 20-25 as a vehicle for assembling antibody fragments 
into oligomeric or multivalent entities for generating 
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chimeric artificial antibodies having preselected 
pharmacokinetic and/or pharmadynamic properties. 

39. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a an oligomer according to any of 

5 claims 20-25 for delivering an imaging or toxin -conjugated 
antibody to a tumor. 

40. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a oligomer according to any of 
claims 20-25 as a vehicle delivering an substance to a target 

10 cell or tissue. 

41. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a oligomer according to any of 
claims 20-25 for a labelled construct wherein the label is 
coupled to one or to of the TTSE monomer units. 

15 42 . The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a oligomer according to any of 
claims 20-25 for protein library technology, such as phage 
display technology. 

43 . The use according to claim 42 comprising a poly 
20 nucleotide molecule encoding one or more TTSE. 

44. The use according to claim 43 comprising a vector 
encoding one or more TTSE. 

45. The use of a monomer polypeptide construct according to 
any of claims 1-19 or to a oligomer according to any of 

25 claims 20-25 for the preparation of a pharmacutical 
composition. 

46. The use according to any to claim 45, wherein the 
pharmaceutical composition further comprises a 
pharmaceutically acceptable excipient. 
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47. The use according to claim 45 or 46 wherein the 
pharmaceutical composition is administered by a route 
selected from the group consisting of the intraveneous route, 
the intraarterial route, the transmembraneus route of the 

5 buccal, anal, vaginal or conjunctival tissue, the intranasal 
route, the pulmonary route, the transdermal route, the 
intramuscular routed, subcutaneous route, intratechal route, 
inoculation into tissue such as a tumour, or by an implant. 

48. The use according to any of claims 35 to 47 

10 wherein the monomer polypeptide construct according to any of 
claims 1-19 or the oligomer according to any of claims 20-25 
is comprised in a liposome 

49. A method for treating or preventing of a disease 
comprising administering to the subject in need thereof an 

15 effective amount of a pharmceutical composition as defined in 
any of claims 45 and 46. 

50 . A method for treating or preventing a disease comprising 
administering to the subject in need thereof an effective 
amount of a relevant pharmceutical coupled to a monomer 

20 polypeptide construct according to any of claims 1-19 or to a 
oligomer according to any of claims 20-25. 

51. A method for targeted gene therapy comprising use of a 
monomer polypeptide construct according to any of claims 1-19 
or to a oligomer according to any of claims 20-25. 

25 52. A method of human gene therapy comprising use of a 

monomer polypeptide construct according to any of claims 1-19 
or to an oligomer according to any of claims 20-25 wherein at 
least one TTSE is modifyed with a membrane integrating or 
associating entity having affinity to the specific population 

30 of cells in the body relevant for the gene therapy. 

53. A method according to any of claims 49 to 52 



wo 98/56906 PCT/DK98/00245 

82 

wherein the monomer polypeptide construct according to any of 
claims 1-19 or the oligomer according to any of claims 20-25 
is administered by a route selected from the group consisting 
of the intraveneous route, the intraarterial route, the 
5 transmembraneus route of the buccal, anal og vaginal tissue, 
intranasal route, the pulmonary route, the transdermal route, 
intramuscular, subcutaneous, intratechal, the buccal, 
inoculation into tissue such as a tumour, or by an implant. 

54. A method for prevention and/or treating a disease, 
10 comprising administering to a mammal in need thereof a 

prophylactically or therapeutically effective amount of a 
construct comprising the monomer polypeptide construct 
according to any of claims 1-19 or the oligomer according to 
any of claims 20-25. 

15 55. A method for diagnosis comprising a construct comprising 
the monomer polypeptide construct according to any of claims 
1-19 or the oligomer according to any of claims 20-25 
together with a diagnosing component coupled thereon. 
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H6FXtripa fusion protein 

1 MGSHHHHHHGSIQGRSPGTEPPTQKPKKIV 30 

31 NAKKDVVNTKMFEELKSRLDTLAQEVALLK 60 

61EQQALQTVSLKGS* 73 



HGFXtripB fusion protein 

1 MGSHHHHHHGSIQGRSPGTEPPTQKPKKIV 30 

31 NAKKDVVNTKMFEELKSRLDTLAQEVALLK 60 

61EQQALQTGS* 69 



Fig. 4 
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CilH6FXTN123 fusion protein 

1 MVRANKRNEALRIES 

31 EGGSHHHHHHGSIEG 

61 KKDVVNTKMFEELKS 

91 QALQTVCLKGTKVHM 

121 SEDCISRGGTLSTPQ 

151 VGNEAEIWIiGLNDMA 

181 KNWETEITAQPDG6K 

211 FDKRCRDQLPYICQF 



ALLNKIAMLGTEKTA 30 
RGEPPTQKPKKIVNA 60 
RLDTLAQEVALLKEQ 90 
KCFLAFTQTKTFHEA 120 
TGSENDALYEYLRQS150 
AEGTWVDMTGARIAY180 
TENCAVLSGAANGKW 210 
G I V * 228 



H6FXTN123 fusion protein 



1 


MGSHHHHHH 


G 


S 


I E G R 


31 


KDVVNTKMF 


E 


E 


L K S R 


61 


AliQTVCLKG 


T 


K 


V H M K 


91 


EDCISRGGT 


L 


S 


T P Q T 


121 


GNEAEXWLG 


L 


N 


D M A A 


151 


NWETEITAQ 


P 


D 


G G K T 


181 


DKRCRDQLP 


Y 


I 


C Q F G 



GEPPTQKPKKIVNAK 30 
LDTLAQEVALLKEQQ 60 
CFLAFTQTKTFHEAS 90 
GSENDALYEYIiRQSV120 
EGTWVDMTGARIAYK150 
ENCAVLSGA A.N G K W F 180 
I V * 



Fig. 6 
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TN12 



GSIEGRGEPP 
GATCCATCGAGGGTAGGGGC6AGCCACCA — 



Q T V * 
--CAGACGGTCTA 



TN23 



GSIQGRVVNT 
GATCCATCCAGGGTAGGGTTGTGAACACA - — 



G I V * 
• GGGATCGT6TA 



TN3 



GSIEGRALQ GIV* 
GATCCATCGAGGGTAGGGCCCTGCAG GGGATCGTGTA 



MGSHHHHHH 
CATATGGGATCGCATCACCATCACCATCACG - - 

Bam H! 



Phage T7 
promoter 



- AGCTTGAATTC 
Hind lit EcoH 




Fig. 7 
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H6FXTN12 fusion protein 

1 MGSHHHHHHGSIEGRGEPPTQKPKKIVNAK 30 

31 KDVVNTKMPEELKSRLDTLAQEVALLKEQQ 60 

61 A L Q T V * ^5 



H6FXTN23 fusion protein 

1 MGSHHHHHHGSIQGRVVNTKMFEELKSRLD 30 

31 TLAQEVALLKEQQALQTVCLKGTKVHMKCF 60 

61 LAFTQTKTFHEASEDCISRGGTLSTPQTGS 90 

91 ENDALYEyLRQSVGNEAEIWLGLNDMAAEG120 

121 TWVDMTGARIAYKNWETEITAQPDGGKTEN150 

151 CAVLSGAANGKWFDKRCRDQLPYICQFGIV180 

181 * 

H6i^N3 fusion protein 

1 MGSHHHHHHGSIEGRALQTVCLKGTKVHMK 30 

31 CFLAFTQTKTFHEASEDCISRGGTLSTPQT 60 

61 GSENDALYEYLRQSVGNEAEIWLGLNDMAA 90 

91 EGTWVDMTGARIAYKNWETEITAQPDGGKT120 

121 ENCAVLSGAANGKWFDKRCRDQLPYICQFG150 

151 I V * 



Fig. 8 
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Fig. 9 
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H6FXtripb-UB fusion protein 



1 MGSHHHHHHGSIQGRSPGTEPPTQKPKKIV 30 

31 NAKKDVVNTKMFEELKSRLDTLAQEVALLK 60 

61 EQQALQTGSQIFVKTLTGKTITLEVEPSDT 90 

91 IENVKAKIQDKEGIPPDQQRLIFAGKQLED120 

121 GRTLSDYNIQKESTLHLVLRLRGGS*145 
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H6FXscFv(CEA6)-tripb fusion protein 

IMGSHHHHHHGSIQGRSQVQLQQSGAEVKKP 30 
31GSSVKVSCKASGGTFSNSPINWLRQAPGQG 60 
61LEWMGSIIPSFGTANYAQKFQGRLTITADE 90 
91STSTAYMELSSLRSEDTAVYYCAGRSHNYE 120 
121 LYYYYMDVWGQGTMVTVSSGGGGSGGGGSG 150 
151 GGGSDIQMTQSPSTLSASIGDRVTITCRAS IBO 
181 EGIYHWLAWYQQKPGKAPKLLIYKASSLAS 210 
211 GAPSRFSGSGSGTDFTIiTISSLQPDDFATY 240 
241 YCQQYSNYPLTFGGGTKLEIKRAAAEQKLI 270 
271 SEEDLNGAGTEPPTQKPKKIVNAKKDVVNT 300 
301 KMFEELKSRLDTLAQEVALLKEQQALQTGS 330 
331 * 
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H6FXtripb-scFv(CEA6) fusion protein 

IMGSHHHHHHGSIQGRSPGT E P PTQKPKKIV 30 
31NAKKDVVNTKMFEELKSRLDTLAQEVALLK 60 
61EQQALQTGSQVQLQQSGAEVKKPGSSVKVS 90 
91CKASGGTFSNSPINWLRQAPGQGLEWMGSI120 
121IPSFGTANYAQKFQGRLTITADESTSTAYM150 
151ELSSLRSEDTAVYYCA6RSHNYELYYYYMD180 
181 VWGQGTMVTVSSGGGGSGGGGSGGGGSDIQ 210 
211 MTQSPSTLSASIGDRVTITCRASEGIYHWL 240 
241 AWYQQKPGKAPKLLIYKASSLASGAPSRFS 270 
271 GSGSGTDFTLTISSLQPDDFATYYCQQYSN 300 
301 YPLTFGGGTKLEIKRAAAEQKLISEEDLNG 330 
331 A * 
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H6FXscFv(CEA6Hripb*scFv(CEA6) fusion protein 

IMGSHHHHHHGSIQGRSQVQLQQSGAEVKKP 30 
31GSSVKVSCKASGGTFSNSPINWLRQAPGQG 60 
61LEWMGSIIPSFGTANYAQKFQGRLTITADE 90 
91STSTAYMELSSLRSEDTAVYYCAGRSHNYE 120 
121 LYYYYMDVWGQGTMVTVSSGGGGSGGGGSG 150 
151 GGGSDIQMTQSPSTLSASIGDRVTITCRAS 180 
181EGIYHWLAWYQQKPGKAPKLLIYKASSLAS 210 
211 GAPSRFSGSGSGTDFTLTISSLQPDDFATY 240 
241 YCQQYSNYPLTFGGGTKIiEIKRAAAEQKLI 270 
271SEEDLNGAGTEPPTQKPKKIVNAKKDVVNT300 
301KMFEELKSRIiDTLAQEVALLKEQQAIiQTGS330 
331QVQLQQSGAEVKKPGSSVKVSCKASGGTFS360 
361NSPINWLRQAPGQGLEWMGSIIPSFGTANY390 
391 AQKFQGRLTITADESTSTAYMELSSLRSED 420 
421TAVYYCAGRSHNYELYYYYMDVWGQGTMVT450 
451 VSSGGGGSGGGGSGGGGSDIQMTQSPSTLS 480 
481ASIGDRVTITCRASEGIYHWLAWYQQKPGK510 
511APKLLIYKASSLASGAPSRFSGSGSGTDFT540 
541LTISSLQPDDFATYYCQQYSNYPLTFGGGT570 
571KLEIKRAAAEQKLISEEDLNGA* 592 
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